Design AI voices from text descriptions or clone your own voice in 30 seconds β then use them across unlimited videos.
Every brand has a voice. Some sound corporate. Others sound friendly. A few sound unforgettable.
The Voices page is your voice laboratory β where you design AI voices from scratch or clone your own voice to use across every video GEN creates.
π‘ Did you know?
71% of GEN users reduced manual editing time by 80% or more β and custom voices are a big reason why. No more recording the same script 20 different ways.
Where Your Voice Library Lives
Click Voices in the left navigation to access your voice laboratory. Here you'll see all your created voices, their language settings, and creation dates. Each voice becomes available instantly in any Speech from Text creation card across all your projects.
Think of this as your voice casting department β except every "actor" is perfectly on-brand, available 24/7, and never needs a coffee break.
Design a Voice with AI
Maybe you need a voice that sounds "warm but authoritative" or "energetic Gen-Z." You don't need to find the perfect voice actor β you can design one.
1. Start the Design Process
From your Voices page, click Create Voice, then select Design a Voice.
2. Configure Voice Attributes
Fill out the voice profile:
Name: Something memorable like "Brand Spokesperson" or "Product Demo Voice"
Language: English, Spanish, or French
Gender: Male, female, or non-binary options
Description: The magic happens here. Describe the personality, tone, and energy you want
Your description might be: "Confident, slightly playful, like a smart friend explaining something cool. Think tech reviewer meets lifestyle influencer."
3. Generate and Test Samples
Add Sample Text β a sentence or two your voice will speak during generation. This lets you hear how it handles your actual content style.
Click Generate and GEN will create multiple AI voice samples matching your description. Listen to each option and select the one that nails your brand vibe.
β‘ Pro Move: Test your designed voice with the actual copy you'll use most often β product descriptions, calls-to-action, or trending hooks. The voice that sounds great reading generic text might not work for your specific content style.
4. Save Your Voice
Once you've selected your favorite sample, click Save. Your new voice appears in your library and becomes immediately available in all Speech from Text creation cards.
"I thought it would feel fake. It doesn't. My audience thinks I hired a team."
ββ Founder, Supplement Brand
Clone Your Voice
Want every video to sound exactly like you? Voice cloning creates a digital version of your voice that can speak any script while maintaining your natural cadence, accent, and personality.
1. Prepare Your Audio Sample
Record yourself speaking for at least 30 seconds in a quiet environment. Read naturally β this isn't a robot training exercise. The better your source audio, the better your clone.
Pro recording tips:
Use a decent microphone (even phone earbuds work)
Record in a quiet room with minimal echo
Speak at your normal pace and energy
Include varied sentence structures and emotions
Save as MP3 format
2. Start Voice Training
From your Voices page, click Create Voice, then select Clone Your Voice.
3. Upload and Configure
Fill out the same basic information as voice design (name, language, gender, description), then upload your MP3 file. The system needs at least 30 seconds but longer samples (up to 2-3 minutes) often produce better results.
4. Train Your Voice Clone
Click Train Voice to start the AI training process. This typically takes 5-15 minutes depending on your audio length and current system load.
You'll get a notification when training completes. The system analyzes your vocal patterns, accent, speaking rhythm, and tonal qualities to create a digital version that can speak any text in your voice.
π How a DTC skincare brand uses this
The founder cloned her voice once, then used it across 52 videos in her first month. Her audience couldn't tell the difference, but she went from spending 3 hours a week recording to zero. "I sound like myself in every video, but I never have to speak."
Using Voices in Your Projects
Once you've created voices, they become available in every Speech from Text creation card across all your projects. Here's how the workflow connects:
1. Access Voices in Creation Cards
In any vidsheet, when you select a cell and choose Speech from Text from the creation card menu, you'll see a Voice dropdown containing all your trained voices.
2. Configure Your Speech
Choose your voice, add your script (or reference a variable from another column), and set timing preferences. Your custom voice will speak whatever text you provide β whether it's a static script or dynamically generated from AI text columns.
3. Generate and Use
Click generate and your voice speaks the script. The audio file automatically appears in your timeline, perfectly timed and ready to layer with visuals.
π‘ Did you know?
82% of GEN users said their audience couldn't tell the content was AI-generated β largely because they use consistent, branded voices that sound natural and on-brand.
Managing Multiple Voices for Different Content
Smart brands don't use one voice for everything. You might need different voices for different content types, products, or audience segments.
Strategic Voice Planning
Main Brand Voice: Your primary spokesperson voice (often the founder's clone)
Product Demo Voice: Clear, instructional tone for tutorials
Trending Content Voice: Younger, more energetic for viral-style content
Testimonial Voice: Authentic, conversational for customer stories
Announcement Voice: Professional, authoritative for news and updates
Voice Organization Tips
Name your voices descriptively so you can quickly select the right one:
"Sarah - Main Brand" instead of just "Sarah"
"Demo Voice - Clear" instead of "Voice 2"
"Trending - Energetic" instead of "New Voice"
β‘ Pro Move: Clone your voice multiple times with different emotional instructions in the description. Same voice, different energy levels. "Sarah - Calm" vs "Sarah - Excited" vs "Sarah - Urgent" gives you three voices from one recording session.
Voice Testing and Iteration
Don't settle for your first attempt. Create multiple versions, test them in actual videos, and see which ones your audience responds to best. You can always create new voices or refine existing ones based on performance.
π‘ Did you know?
A supplement brand founder posted 31 videos in 21 days using voice cloning and saw an 18% increase in TikTok Shop revenue. The secret? Every video sounded like a personal recommendation from the founder, but required zero recording time.
What's Next
With your voice library established, you're ready to create speech-driven content at scale. Your voices work seamlessly with GEN's other creation tools β combine them with AI-generated scripts, variables, and automated posting for a complete content engine.
What brands like yours are doing:
ποΈ E-commerce brands β Clone founder voice for authentic product recommendations
π Beauty brands β Multiple voices for tutorials, testimonials, and trending content
ποΈ Fitness brands β Motivational voice for workout content and tips
π¬ Faceless YouTube creators β AI visuals + voice clone + auto-captions for Shorts and long-form
Keep Learning
Create Talking Avatar Videos β use your voice with a lip-synced avatar
Speech from Text Reference β full input details and language support
Create Your First Video β put your voice to work in a project
Ready to start creating?
Browse our library of ready-made templates and launch your first video in minutes.
