Kokoro TTS

Kokoro TTS is an open-source text-to-speech model and inference pipeline that generates high-quality natural-sounding speech from text input. The system features the Kokoro-82M model, an 82 million parameter TTS architecture designed for efficient and expressive speech synthesis. Kokoro TTS provides clear, human-like audio output suitable for applications in voice assistants, audiobook generation, accessibility tools, and multimedia content creation. The Gradio-based interface allows users to input text and listen to generated speech in real-time, with options to adjust voice characteristics and audio quality.

Reader rating

No ratings yet

Visit website

Related tools

View all

Synthesia

No ratings yet

Synthesia is an AI video generation platform for creating presenter-led business videos without cameras, studios, or traditional editing. Users can turn scripts, documents, or training material into polished videos with AI avatars, voiceovers, templates, localization, and brand controls. It is commonly used for employee training, onboarding, sales enablement, product explainers, customer education, and internal communications. The platform fits learning teams, marketing departments, operations leaders, and global companies that need consistent video content in multiple languages. Synthesia is distinctive because it focuses on enterprise-ready avatar video production, making repeatable training and communication videos faster to produce while keeping style, messaging, and localization under control.

Pictory.AI

No ratings yet

Pictory.ai is an AI-powered platform that enables users to create professional-quality videos from text, URLs, or long-form content. It offers features like automatic captioning, realistic AI voiceovers, and access to a vast library of royalty-free visuals and music. Designed for ease of use, Pictory requires no prior video editing experience, making it suitable for content creators, marketers, and educators

Palabra.ai

No ratings yet

Palabra.ai is a real-time voice AI translator that provides speech-to-speech translation in under one second across 60+ languages. The platform supports live calls, events, streams, and meetings with voice cloning capabilities, making it 9.3x cheaper than hiring a human interpreter. Palabra.ai is aimed at international businesses, event organizers, customer support teams, and content creators who need instant multilingual communication without language barriers. The platform has translated over 500,000 minutes for enterprise clients including DHL, UNICEF, Paramount, Hyundai, BCG, Deloitte, Fujitsu, and eToro. It was named #1 Product of the Day and #1 Product of the Week on Product Hunt, and has raised $8.4M. What makes Palabra.ai stand out is the combination of sub-second latency, voice cloning, enterprise-grade reliability, and broad language coverage that makes real-time translation practical for production use rather than demo-only scenarios.