30 open-source projects similar to solidspoon/dashplayer, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DashPlayer alternative.
LLPlayer is a language learning media player and AI subtitle generator that integrates large language models for real-time audio transcription and translation. It functions as an LLM-integrated video player and SRT transcription tool, utilizing local or remote AI models to generate text subtitles from audio and video streams. The project distinguishes itself through a contextual translation workflow that sends preceding subtitle lines to language models to maintain conversational flow and sentence structure. It also includes an optical character recognition system to convert bitmap-based subt
LunaTranslator is a real-time translation tool designed for visual novels and games. It functions as a multi-engine translation hub and text extractor that captures dialogue via memory hooking or optical character recognition to convert it into a target language. The project distinguishes itself through specialized linguistic tools, including a Japanese text analyzer for sentence segmentation and phonetic readings. It also operates as a digital dictionary aggregator, querying multiple online and offline databases simultaneously to provide comprehensive vocabulary definitions for language lear
GoldenDict-ng is a multi-source dictionary application and offline dictionary reader that enables users to search for word definitions across local files, DICT servers, and web sources in a single interface. It functions as a web-based definition browser, rendering entries using a browser engine to support HTML, CSS, and JavaScript for rich content presentation. The project distinguishes itself by integrating with Anki flashcard systems to facilitate language learning workflows and offering specialized translation tools that support clipboard monitoring and character set conversion. It also p
This project is a systematic framework for English language acquisition that applies structured workflows and cognitive strategies to build linguistic proficiency. It focuses on the construction of a linguistic knowledge base, enabling learners to master vocabulary and grammar through methodical training. The methodology is distinguished by its use of computer science concepts, such as mental-model-based learning and memory buffers, to organize progression. It emphasizes a cognitive-translation bypass to develop target language thinking, reducing mental latency by processing information direc
MuJing is a contextual English vocabulary learner and interactive media player designed for language study. It extracts words from videos and documents to provide real-world examples and media clips for memorization, functioning as a subtitle-based language tool and a lemma-based word list generator. The system differentiates itself by linking vocabulary lists to specific video timestamps and subtitles for auditory and visual reinforcement. It includes a video player with bilingual subtitles and keyboard-based transcription and spelling exercises to build muscle memory through movie and telev
Bilive is a multimodal AI video pipeline and live stream recording tool designed to capture real-time broadcasts and automate the creation of highlight clips. It functions as a multi-platform stream orchestrator capable of distributing looped pre-recorded content and managing the automated upload of processed video clips to various destinations. The system distinguishes itself through AI-driven content generation, using comment density to detect high-energy segments and multimodal models to automatically produce descriptive titles and synchronized subtitles. It further utilizes image-to-image
Downkyi is a desktop application designed for the batch retrieval and archiving of high-resolution video content from Bilibili. It functions as a media downloader that automates the collection of digital libraries by managing concurrent download tasks through a thread-pooled queue. The application distinguishes itself by integrating a multimedia framework to perform post-download operations, including stream remuxing, audio extraction, and watermark removal. It utilizes a modular plugin architecture to decouple stream resolution logic from the core application, allowing for independent update
Pytube is a Python library and command line interface for downloading videos, playlists, and captions from YouTube. It functions as both a programmatic tool for metadata extraction and a standalone media downloader. The project is designed using only the Python standard library to avoid external package dependencies. It utilizes regular expression-based HTML parsing to extract stream URLs and asset details directly from the platform. The library supports retrieving video metadata and thumbnails, as well as extracting caption tracks. It provides capabilities for downloading entire playlists a
VidBee is a self-hosted media download manager that wraps the yt-dlp engine to download videos and audio from over 1000 websites. It functions as both a desktop client and a Fastify-based web service, managing downloads through a persistent queue with pause, resume, retry, and real-time progress tracking. The application uses cookie-based authentication to access login-gated, age-restricted, or subscriber-only content by importing browser cookies or Netscape-format cookie files. The application distinguishes itself through automated download workflows, including RSS and Atom feed monitoring t
h5player is an HTML5 video player extension and web media controller that adds advanced playback controls, visual filters, and media downloading capabilities to any web page using the HTML5 video tag. It functions as a customizable media hotkey manager and real-time video filter tool to enhance the standard browser viewing experience. The project is distinguished by its configuration-driven extension system, which allows for the remapping of playback shortcuts and the addition of new features through external scripts. It also provides a real-time visual filtering suite for modifying brightnes
Han1meViewer is an Android media viewer application for browsing, streaming, and downloading media content from a specific external website. It functions as a privacy-focused media browser that adapts external site content to a mobile-optimized interface. The application features tools for bypassing network restrictions through proxy and CDN configuration. It provides privacy protections including application locks and launcher icon disguises to hide the application's purpose. The project covers a wide range of capabilities, including background video downloading for offline media management
This project is a Go library and command-line utility designed for the retrieval and local archival of remote video content. It provides a programmatic interface for fetching media streams, allowing users to extract metadata and download video files directly to local storage. The library distinguishes itself through its ability to resolve playback restrictions by performing algorithmic transformations on obfuscated authentication tokens. This signature decryption process enables the tool to bypass standard access limitations, while its interface-driven design allows for the selection of speci
VideoLingo is an automated video localization suite designed to transcribe, translate, and dub video content. It functions as a translation pipeline that utilizes large language models to convert spoken audio into precise text segments and translate them into multiple languages. The system differentiates itself through a multi-step translation refinement process and a specialized natural language processing utility that segments text into single-line captions meeting broadcast standards. It also integrates synthetic voiceover generation to replace or augment original audio tracks. The projec
ani-cli is a command line interface designed for searching, streaming, and downloading anime content. It functions as a terminal-based media utility that isolates and retrieves direct video streams from external web sources, allowing users to browse catalogs and play media without a web browser. The tool distinguishes itself through a modular provider architecture that uses pattern matching to extract metadata and stream links from various sites. It delegates actual video decoding and playback to external system media players, providing users with control over resolution, audio tracks, and pl
Piko is a collection of specialized Android utilities designed for binary patching, user interface customization, media downloading, and application debugging. It functions as an APK modification framework that alters application metadata and binary resources to modify internal logic and bypass version locks. The project differentiates itself through deep application modification capabilities, including bytecode-level instruction patching, DEX file manipulation, and a dynamic hooking engine. It enables the overriding of resource files and the injection of permissions via the application manif
This project provides an English dictionary REST API and a corresponding web browser extension. It serves as a programmatic interface for retrieving linguistic data, including word definitions, phonetic transcriptions, parts of speech, and synonyms via HTTP requests. The browser extension implements in-browser word lookup by injecting content scripts that listen for double-click events. When a user selects text, the tool renders a DOM-based overlay to display the word's meaning in a pop-up bubble at the cursor position. The system utilizes asynchronous network fetching and external API proxy
Autosub is a command-line media processor and automatic subtitle generator that converts audio streams from video and audio files into timed text overlays. It functions as an AI speech-to-text converter that uses OpenAI Whisper to generate synchronized subtitles. The tool includes a language translation pipeline to convert transcribed speech into target languages, enabling multilingual video captioning. It manages the process from audio-stream extraction to the serialization of final subtitle files for local storage. The system covers audio-to-text transcription, time-stamped text mapping, a
SmartSub is a cross-platform desktop application for AI-driven video transcription and subtitle generation. It converts audio and video files into text subtitles using local AI models and incorporates hardware acceleration to increase processing speed. The tool features a subtitle translator that leverages large language models, such as OpenAI and DeepSeek, to convert subtitles between different languages. It includes a visual editor for proofreading and polishing transcribed text, paired with a video preview for frame-accurate synchronization. The software supports batch processing of multi
ECDICT is a collection of structured linguistic datasets and an English-Chinese dictionary database. It provides bilingual word definitions, phonetic symbols, and parts of speech, alongside a bilingual geographic gazetteer that maps English place names to Chinese equivalents. These resources are available as a multi-format lexicon export in CSV, SQL, StarDict, and MDX formats. The project distinguishes itself by integrating a linguistic corpus dataset that includes word frequency rankings and academic syllabus markers derived from national corpora. It functions as an educational vocabulary re
VideoCaptioner is an automated tool designed to generate and embed time-synchronized subtitles into video files. By leveraging speech recognition models, the software converts spoken audio into text and calculates precise timestamps to ensure captions align with the original media. The project operates as a local-first inference pipeline, performing all transcription tasks on the host machine to maintain data privacy. It utilizes a transformer-based neural network for speech recognition and integrates a multimedia framework to handle the technical aspects of video processing and subtitle stre
This project is a comprehensive Lisp AI implementation library that provides reference implementations for various artificial intelligence paradigms and symbolic algorithms. It functions as a multi-purpose toolkit containing a logic programming engine, a natural language processing suite, and a symbolic mathematics toolkit. The library is distinguished by its diverse architectural frameworks, including a Prolog-style execution engine that uses unification and goal-driven backtracking, and a system for simulating human decision-making through expert system shells and certainty factors. It also
This project is an AI video post-production suite that uses large language models and programmatic tools to automate editing, transcription, and subtitle generation. It functions as an AI editing agent that translates natural language instructions into shell commands, providing a programmatic interface for manipulating media via FFmpeg. The toolkit includes a motion graphics engine that generates technical animations and visual overlays through code-driven rendering and mathematical definitions. It distinguishes itself by combining an AI-powered transcriber for word-level timestamps with an a
This is a Windows application for automatic speech recognition that transcribes spoken audio from video files into timestamped SRT subtitle files. It serves as a subtitle generator and translation tool that converts media speech into synchronized text. The software functions as a batch media transcriber, allowing the simultaneous processing of multiple audio and video files to generate subtitles in bulk. It includes a translation workflow to convert transcriptions between different languages for the creation of bilingual or localized files. The system also provides text refinement capabiliti
NarratoAI is an automated video production pipeline that uses large language models to generate scripts, voiceovers, and edited video commentary. It functions as a combined scriptwriter, voiceover generator, and video editor to streamline the creation of movie and television commentary content. The system automates the production workflow by converting input data into structured narrative scripts, synthesizing artificial speech for narration, and programmatically assembling video clips based on script timestamps. It also converts spoken audio from video files into written text for subtitles a
Easydict is a macOS dictionary and translator application that integrates system dictionaries, external translation services, and Large Language Models such as OpenAI and Gemini. It functions as an OCR text extractor and a text-to-speech reader, allowing users to look up words and translate text directly on the desktop. The application features a local OCR engine that captures screen areas to recognize and translate text that cannot be highlighted or copied. It utilizes a provider-agnostic translation pipeline and adapter-based service integration to standardize responses from various cloud a
This is a Chinese natural language processing toolkit providing a suite of tools for word segmentation, part-of-speech tagging, and named entity recognition. It includes a neural dependency parser for analyzing syntactic and semantic relationships between words and a machine learning training suite for creating custom linguistic models using annotated datasets. The toolkit distinguishes itself through its deployment flexibility, offering a dockerized server and a web service interface that exposes processing capabilities via API. It supports the use of pretrained models and allows for the int
This project is a structured computer science educational course consisting of video lessons, curated playlists, and translated study materials. It delivers a comprehensive curriculum covering foundational computing principles, ranging from basic logic and hardware architecture to artificial intelligence. The project facilitates bilingual technical learning through dual-language video subtitles and translated learning materials. These resources, including knowledge maps and supplementary notes, are designed to help non-native English speakers acquire industry-standard technical terminology by
Auto-subs is an AI transcription and automatic captioning tool that converts spoken audio from video files into synchronized subtitles. It functions as a subtitle generator and a transcription bridge, enabling the conversion of speech to text with automatic speaker identification and multi-language translation support. The software prioritizes data privacy by utilizing on-device AI inference to process audio and video files locally on the user's hardware. It distinguishes itself by offering deep integration with professional video editing workflows, allowing users to export timing and transcr
Annie is a command-line video downloader and web video extraction library written in Go. It functions as a concurrent media downloader designed to fetch video files and playlists from websites via URLs. The tool distinguishes itself through a proxy-aware network layer that supports SOCKS5 and HTTP proxies to bypass regional content restrictions. It also incorporates session cookie integration and referrer spoofing to facilitate the download of authenticated or age-gated content. The project provides capabilities for bulk media acquisition, including batch downloading from text files and extr
This is a tool for downloading videos, images, and audio from the Douyin social media platform using shareable URLs or profile links. It can download individual posts, entire user profiles including all posts and liked content, collections, and music tracks, with options for watermark-free and high-quality output. The tool also supports live stream recording, comment collection, and keyword-based content search with JSONL export. The project distinguishes itself through an integrated REST API server that accepts download and transcription requests, tracks job status, and exposes health check