Quick Start

This guide covers every Uberduck feature:
- Getting Started — Create account and basic setup
- How to Use AI Voice Cloning — Clone any voice from a short audio sample
- How to Use Image Generator — Create AI images from text prompts
- How to Use AI Sound Generator — Generate custom sound effects on demand
- How to Use Voice Transformation — Change your voice style in real time
- How to Use Text-to-Speech — Convert written text into spoken audio
- How to Use AI Voiceover — Record professional voiceovers for any project
- How to Use AI Music Studio — Create AI rap tracks and songs from text
Time needed: 5 minutes per feature
Also in this guide: Pro Tips | Common Mistakes | Troubleshooting | Pricing | Alternatives
Why Trust This Guide
I’ve used Uberduck for over 6 months and tested every feature covered here. This how to use Uberduck tutorial comes from real hands-on experience — not marketing fluff or vendor screenshots.

Uberduck is one of the most creative AI voice tools available today.
But most users only scratch the surface of what it can do.
This guide shows you how to use every major feature.
Step by step, with screenshots and pro tips.
Uberduck Tutorial
This complete Uberduck tutorial walks you through every feature step by step, from initial setup to advanced tips that will make you a power user.

Uberduck
Turn text into speech, music, and rap using AI-powered voices and custom voice clones. Uberduck gives you access to thousands of expressive voices for any creative project. Try it free with 300 render credits.
Getting Started with Uberduck
Before using any feature, complete this one-time setup.
It takes about 3 minutes.
Now let’s walk through each step.
Step 1: Create Your Account
Go to Uberduck’s website at uberduck.ai.
Click “Sign Up” in the top right corner.
Enter your email and create a password.
✓ Checkpoint: Check your inbox for a confirmation email.
Step 2: Access the Dashboard
Uberduck is a web-based platform with no downloads needed.
Log in with your new account at app.uberduck.ai.
Here’s what the dashboard looks like:

✓ Checkpoint: You should see the main dashboard with all features listed.
Step 3: Complete Initial Setup
Select your primary use case when prompted.
Choose between content creation, music, or development.
You start with 300 free render credits per month.
✅ Done: You’re ready to use any feature below.
How to Use Uberduck AI Voice Cloning
AI Voice Cloning lets you create a custom voice model from a short audio sample.
Here’s how to use it step by step.
Watch AI Voice Cloning in action:

Now let’s break down each step.
Step 1: Open the Voice Cloning Tool
Click “Voice Cloning” from the main dashboard menu.
You will see the upload interface for audio samples.
Step 2: Upload Your Audio Sample
Record or upload a clean audio clip of the voice you want to clone.
Aim for at least 30 seconds of clear speech without background noise.
✓ Checkpoint: You should see your audio file listed in the upload area.
Step 3: Generate Your Voice Clone
Name your voice clone and click “Create Voice.
Wait a few minutes while Uberduck processes your sample.
Test it by typing any text and clicking “Generate.”
✅ Result: Your custom voice clone is ready for text-to-speech and music projects.
💡 Pro Tip: Use a quiet room and a good microphone for best results. Background noise reduces clone quality significantly.
How to Use Uberduck Image Generator
Image Generator lets you create AI-powered images from text prompts.
Here’s how to use it step by step.
Watch Image Generator in action:

Now let’s break down each step.
Step 1: Open the Image Generator
Navigate to “Image Generator” from the sidebar or dashboard.
The text prompt field will appear at the top of the page.
Step 2: Enter Your Prompt
Type a detailed description of the image you want to create.
Be specific about style, colors, and composition for better results.
✓ Checkpoint: You should see a preview of your prompt ready to generate.
Step 3: Generate and Download
Click “Generate” and wait for the AI to create your image.
Download the result or regenerate with a modified prompt.
✅ Result: Your AI-generated image is ready to use in any project.
💡 Pro Tip: Add style keywords like “photorealistic,” “cartoon,” or “watercolor” to guide the AI’s output direction.
How to Use Uberduck AI Sound Generator
AI Sound Generator lets you create custom sound effects from text descriptions.
Here’s how to use it step by step.
Watch AI Sound Generator in action:

Now let’s break down each step.
Step 1: Open the Sound Generator
Click “Sound Generator” from the main navigation menu.
You will see a text input field for describing your sound.
Step 2: Describe Your Sound Effect
Type a description like “thunder rumbling in the distance” or “doorbell chime.”
The more specific your description, the better the output quality.
✓ Checkpoint: Your sound description should appear in the input field.
Step 3: Generate and Export
Click “Generate” and preview the sound effect.
Download the audio file in your preferred format.
✅ Result: Your custom sound effect is ready to add to any audio or video project.
💡 Pro Tip: Generate multiple variations of the same sound and pick the best one. Each generation produces slightly different results.
How to Use Uberduck Voice Transformation
Voice Transformation lets you change a recorded voice’s style while keeping its cadence.
Here’s how to use it step by step.
Watch Voice Transformation in action:

Now let’s break down each step.
Step 1: Open Voice Transformation
Select “Voice to Voice” from the dashboard menu.
The upload interface will appear for your source audio.
Step 2: Upload and Select Target Voice
Upload your recorded audio file or record directly in the browser.
Choose a target voice from the voice library to transform into.
✓ Checkpoint: Both your source audio and target voice should be selected.
Step 3: Transform and Download
Click “Transform” and wait for the AI to process your audio.
Preview the result and download if satisfied.
✅ Result: Your voice has been transformed into the selected style while keeping your delivery.
💡 Pro Tip: Speak clearly and at a steady pace for the best transformation results. Rushing can cause audio artifacts.
How to Use Uberduck Text-to-Speech
Text-to-Speech lets you convert any written text into spoken audio in various voices.
Here’s how to use it step by step.
Watch Text-to-Speech in action:

Now let’s break down each step.
Step 1: Open the TTS Tool
Click “Text to Speech” from the main dashboard.
The text editor and voice selector will load on screen.
Step 2: Enter Text and Choose a Voice
Type or paste the text you want converted to speech.
Browse the voice library and select one that fits your project.
✓ Checkpoint: Your text and selected voice should both appear in the interface.
Step 3: Generate and Export Audio
Click “Generate” to create the audio file.
Preview it, then download as MP3 or WAV.
✅ Result: Your text has been converted into a natural-sounding audio file.
💡 Pro Tip: Add punctuation and line breaks to control pacing. Commas create short pauses, periods create longer ones.
How to Use Uberduck AI Voiceover
AI Voiceover lets you create professional narration for videos and podcasts.
Here’s how to use it step by step.
Watch AI Voiceover in action:
Now let’s break down each step.
Step 1: Open the Voiceover Tool
Navigate to the “AI Voiceover” section from the dashboard.
You will see options for script input and voice selection.
Step 2: Write or Paste Your Script
Enter your voiceover script in the text editor.
Select your preferred voice and adjust speed or pitch settings.
✓ Checkpoint: Your script and voice settings should be configured.
Step 3: Generate and Download
Click “Generate Voiceover” to create the audio.
Preview the result and download for use in your video or podcast.
✅ Result: Your professional AI voiceover is ready to add to any project.
💡 Pro Tip: Break long scripts into shorter sections. This gives you more control over pacing and tone throughout the narration.
How to Use Uberduck AI Music Studio
AI Music Studio lets you create AI rap tracks and songs from text input.
Here’s how to use it step by step.
Watch AI Music Studio in action:
Now let’s break down each step.
Step 1: Open the Music Studio
Click “AI Music” or “Rap Generator” from the dashboard.
The beat selector and lyrics editor will appear.
Step 2: Choose a Beat and Add Lyrics
Browse the beat library and select a backing track you like.
Type your own lyrics or let the AI generate them based on a topic.
✓ Checkpoint: Your beat and lyrics should both be loaded in the studio.
Step 3: Select Voice and Generate
Pick a voice model for your rap or song.
Click “Generate” and let Uberduck create the complete track.
Download or share the finished audio file.
✅ Result: Your AI-generated rap or song track is ready to share or download.
💡 Pro Tip: Keep lyrics to four lines per verse for the best rhythm. The AI syncs better with shorter, punchier sections.
Uberduck Pro Tips and Shortcuts
After testing Uberduck for over 6 months, here are my best tips.
Keyboard Shortcuts
| Action | Shortcut |
|---|---|
| Play/Pause Audio Preview | Spacebar |
| Generate Audio | Ctrl + Enter |
| Select All Text | Ctrl + A |
| Download Last Generation | Ctrl + D |
Hidden Features Most People Miss
- Prompt Builder: Use the built-in prompt builder to structure complex TTS requests with specific voice directions and emotions.
- API Batch Processing: If you have a Creator plan or above, use the API to generate hundreds of audio files automatically from a script list.
- Voice Stacking: Layer multiple voice generations at different pitches to create unique chorus effects for music tracks.
Uberduck Common Mistakes to Avoid
Mistake #1: Using Low-Quality Audio for Voice Cloning
❌ Wrong: Recording voice samples with background noise or a laptop microphone.
✅ Right: Use a quiet room and a decent USB microphone for clean, clear audio samples.
Mistake #2: Burning Through Credits on Test Generations
❌ Wrong: Generating full-length audio files just to test a voice or setting.
✅ Right: Use short test phrases first, then generate the full audio once you are happy with the voice.
Mistake #3: Ignoring Licensing Restrictions
❌ Wrong: Using free plan audio in commercial projects without checking the license.
✅ Right: Upgrade to the Creator plan ($5/month) for a commercial license before publishing any content.
Uberduck Troubleshooting
Problem: Audio Generation Fails or Times Out
Cause: Server load is high or your input text is too long for a single request.
Fix: Break your text into shorter sections and try generating again. Wait a few minutes during peak hours.
Problem: Voice Clone Sounds Robotic
Cause: The audio sample was too short or contained background noise.
Fix: Re-record with a better microphone in a quiet space. Aim for at least 60 seconds of clear speech.
Problem: Credits Running Out Too Fast
Cause: Each second of generated audio costs one render credit. Long generations drain credits quickly.
Fix: Use short test phrases before generating full audio. Upgrade to the Creator or Pro plan for more monthly credits.
📌 Note: If none of these fix your issue, contact Uberduck support.
What is Uberduck?
Uberduck is an AI voice tool that turns text into speech, music, and rap using synthetic voices.
Think of it like a recording studio powered entirely by artificial intelligence.
Watch this quick overview:
It includes these key features:
- AI Voice Cloning: Create custom voice models from short audio samples for any project.
- Image Generator: Generate AI-powered images from text prompts directly inside the platform.
- AI Sound Generator: Create custom sound effects by describing what you want in plain text.
- Voice Transformation: Change a recorded voice into another style while keeping the original delivery.
- Text-to-Speech: Convert written text into spoken audio across thousands of AI voices.
- AI Voiceover: Create professional narration for videos, podcasts, and social media content.
- AI Music Studio: Generate AI rap tracks, songs, and vocal performances from text and beats.
Content creators use Uberduck for video voiceovers and social media clips.
Musicians use it to generate vocal melodies and complete rap tracks.
Developers use the API to add voice features to their apps.
For a full review, see our Uberduck review.

Uberduck Pricing
Here’s what Uberduck costs in 2026:
| Plan | Price | Best For |
|---|---|---|
| Free | $0 | Testing features with 300 credits/month |
| Starter | $2/month | Personal use and exploration |
| Creator | $5/month | Content creators needing commercial license and API |
| Pro | $30/month | Businesses and high-volume creators |
| Enterprise | Custom | Large teams with custom requirements |
Free trial: Yes — 300 render credits per month on the free plan, no credit card needed.
Money-back guarantee: Contact support within the first billing cycle for refund requests.

💰 Best Value: Creator plan at $5/month — it unlocks commercial licensing, API access, and AI rap generation.
My Experience with Uberduck Pricing
I started on the free plan and hit the 300-credit limit within a week.
The Creator plan at $5/month gave me enough credits for daily use.
The commercial license alone makes it worth the upgrade.
If you are producing content regularly, the Pro plan at $30/month is the safest choice.

Uberduck vs Alternatives
How does Uberduck compare? Here’s the competitive landscape:
| Tool | Best For | Price | Rating |
|---|---|---|---|
| Uberduck | AI rap and creative vocals | $2/mo | ⭐ 4.0 |
| ElevenLabs | Ultra-realistic voice cloning | $5/mo | ⭐ 4.8 |
| Murf AI | Business voiceovers | $19/mo | ⭐ 4.3 |
| LOVO Studio | Video and voice creation | $29/mo | ⭐ 4.2 |
| Synthesys | AI avatars and voice | $20/mo | ⭐ 3.8 |
| Play.ht | Podcast and long-form audio | $31/mo | ⭐ 4.1 |
| Descript | All-in-one audio/video editing | $16/mo | ⭐ 4.5 |
Quick picks:
- Best overall: ElevenLabs — industry-leading voice quality and the largest voice library.
- Best budget: Uberduck — free plan available with paid plans starting at just $2/month.
- Best for beginners: Murf AI — clean interface with drag-and-drop voice editing.
- Best for music creation: Uberduck — the only tool with a built-in AI rap generator.
🎯 Uberduck Alternatives
Looking for Uberduck alternatives? Here are the top options:
- 🚀 ElevenLabs: The gold standard in AI voice generation with 10,000+ voices in 70+ languages and the most realistic speech quality available.
- 💰 Murf AI: Professional voiceover platform with studio-grade voices, perfect for marketing teams and corporate training content.
- 🎨 LOVO Studio: All-in-one voice and video creation platform with 500+ voices in 100+ languages and built-in video editing.
- ⚡ Synthesys: Combines AI voice generation with AI avatar videos for a complete synthetic media creation platform.
- 🔒 Play.ht: Specializes in podcast and long-form audio with 600+ voices and built-in WordPress integration for bloggers.
- 🧠 Descript: All-in-one audio and video editor with text-based editing, transcription, and AI voice features built right in.
For the full list, see our Uberduck alternatives guide.
⚔️ Uberduck Compared
Here’s how Uberduck stacks up against each competitor:
- Uberduck vs ElevenLabs: ElevenLabs wins on voice realism and language support. Uberduck wins on music generation and lower entry price.
- Uberduck vs Murf AI: Murf AI is better for corporate voiceovers. Uberduck offers more creative tools like rap generation and voice transformation.
- Uberduck vs LOVO Studio: LOVO has a more polished video editor. Uberduck stands out with its unique AI music studio and rap features.
- Uberduck vs Synthesys: Synthesys adds AI avatars for video. Uberduck is better for pure audio creation and music production.
- Uberduck vs Play.ht: Play.ht is stronger for long-form narration. Uberduck offers more creative and musical AI voice features.
- Uberduck vs Descript: Descript is an all-in-one editor for audio and video. Uberduck focuses on AI voice synthesis and music creation.
Start Using Uberduck Now
You learned how to use every major Uberduck feature:
- ✅ AI Voice Cloning
- ✅ Image Generator
- ✅ AI Sound Generator
- ✅ Voice Transformation
- ✅ Text-to-Speech
- ✅ AI Voiceover
- ✅ AI Music Studio
Next step: Pick one feature and try it now.
Most people start with Text-to-Speech.
It takes less than 5 minutes.
Frequently Asked Questions
What happened to Uberduck AI voices?
Uberduck removed most celebrity and character voices in 2023 due to legal issues and the SAG-AFTRA strike. The platform went from over 5,000 voices to a smaller curated library. You can still use the remaining TTS voices, AI vocal voices, and create your own custom voice clones.
Is Uberduck free to use?
Yes, Uberduck offers a free plan with 300 render credits per month. One credit equals roughly one second of audio. The free plan is limited to non-commercial use. For commercial projects, you need the Creator plan at $5/month or higher.
Can I use Uberduck for commercial use?
Commercial use requires the Creator plan ($5/month) or higher. The free and Starter plans are limited to non-commercial and personal use only. Always check the licensing terms for specific voices before publishing content.
How do you clone your voice on Uberduck?
Go to the Voice Cloning section, upload a clean audio sample of at least 30 seconds, name your voice, and click “Create Voice.” The AI processes your sample and creates a custom voice model you can use for text-to-speech and music projects.
How to use Uberduck AI?
Sign up for a free account at uberduck.ai. Choose a feature like Text-to-Speech or AI Rap from the dashboard. Enter your text, select a voice, and click “Generate.” Download the audio file when it is ready. Each feature follows this simple input-select-generate pattern.













