Legal:
Dev Stack:
Server Stack:
A powerful and intuitive interface for transcribing audio and video files using OpenAI's advanced AI models. Transform your recordings into text and generate various outputs like summaries, meeting notes, and more.
- Audio/Video Transcription: Convert speech to text from various audio and video formats
- Multiple Output Formats: Generate transcriptions, summaries, meeting notes, and more
- Language Support: Transcribe content in multiple languages
- Translation: Translate transcriptions between supported languages
- Custom Categories: Different processing modes for various use cases (RecordBud, MeetPaper, etc.)
- Dark/Light Themes: Comfortable viewing in any lighting condition
- Internationalization: Support for multiple languages in the UI
- Responsive Design: Works perfectly on desktop, tablet, and mobile devices
- Privacy-Focused: Your API key and data stay in your browser
AI Transcriber is a modern web application designed to simplify the process of transcribing audio and video content. Whether you're creating meeting notes, transcribing interviews, or generating summaries of recorded content, AI Transcriber provides an elegant solution with its intuitive interface and powerful AI capabilities.
- Ollama Support is in the making
- Node.js (v16 or newer)
- npm (v7 or newer)
- OpenAI API key
# Clone the repository
git clone https://github.com/marvinpoo/transcriber.git
cd transcriber
# Install dependencies
npm install
# Start the development server
npm run devThe application will be available at http://localhost:5173 by default.
# Install dependencies
npm install
# Start development server with hot-reload
npm run dev
# Build for production
npm run build
# Preview production build locally
npm run preview
# Run linting
npm run lint- React 18: For building the user interface
- TypeScript: For type-safe code
- Vite: For fast development and optimized builds
- SCSS: For styling with a modular approach
- OpenAI API: For AI-powered transcription and text generation
- react-dropzone: For file upload functionality
- lucide-react: For beautiful, consistent icons
src/
βββ assets/ # Static assets like images
βββ components/ # React components
β βββ ApiKeyForm.tsx # API key input component
β βββ FileUploader.tsx # File upload component
β βββ LanguageToggle.tsx # Language selection component
β βββ OutputSelector.tsx # Output type selector
β βββ Results.tsx # Display results component
β βββ ThemeToggle.tsx # Theme switcher component
β βββ ... # Other components
βββ data/ # Application data and constants
βββ hooks/ # Custom React hooks
βββ services/ # Service classes for API interactions
βββ styles/ # CSS styles
βββ types/ # TypeScript type definitions
βββ utils/ # Utility functions
βββ App.tsx # Main application component
βββ main.tsx # Application entry point
Upload audio or video files and convert speech to text using OpenAI's powerful models. Supports various formats including MP3, WAV, M4A, MP4, MOV, and AVI.
Generate different types of outputs based on your needs:
- Standard Transcription: Clean text without filler words
- Verbatim Transcription: Includes all vocal expressions and non-verbal sounds
- Speaker Labels: Transcription formatted as a conversation with speaker identification
- Summary: Concise summary of the content
- Meeting Notes: Structured notes with action items, decisions, and deadlines
Transcribe content in multiple languages including English, German, Spanish, French, Italian, Portuguese, Russian, Japanese, Korean, and Chinese.
Optionally translate your transcriptions to a different language, making it easy to work with multilingual content.
AI Transcriber supports all modern browsers including:
- Chrome (latest)
- Firefox (latest)
- Safari (latest)
- Edge (latest)
The application is configured for easy deployment on any static hosting service.
# Build the application
npm run build
# The build output will be in the 'dist' directory
# which can be deployed to any static hosting service- API Key Issues: Ensure your OpenAI API key is valid and has sufficient credits
- File Format Problems: Check that your audio/video file is in a supported format
- Transcription Errors: For better results, use clear audio with minimal background noise
Contributions are welcome! Please check out our Contribution Guidelines for details on how to get started.
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add some amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
This project is licensed under the GPL-3.0 License - see the LICENSE file for details.
AI Transcriber is designed with privacy in mind. Your OpenAI API key and files are processed locally in your browser and are not stored on any server. For more information, see our Privacy Policy.
If you discover a security vulnerability, please follow our Security Policy for responsible disclosure.
Made with β€οΈ by marvinpoo