@elizaos/plugin-tts
A plugin for text-to-speech(TTS) generation using the FAL.ai API within the ElizaOS ecosystem.
Description​
The text-to-speech(TTS) plugin enables AI-powered creation of speech through FAL.ai's services. It provides functionality to generate audio from text descriptions, automatically detects language, and selects appropriate voice models.
Installation​
bun install @elizaos/plugin-tts
Configuration​
The plugin requires the following environment variable or runtime setting to be set:
FAL_API_KEY=<Your FAL.ai API key>
Usage​
Basic Integration​
import { TTSGenerationPlugin } from '@elizaos/plugin-tts';
Voice Generation Examples​
// The plugin responds to natural language commands like:
'Generate TTS of Hello World';
'Create a TTS for Welcome to ElizaOS';
'Make a TTS saying [your text]';
API Reference​
Actions​
GENERATE_TTS​
Generates speech audio based on text input.
Aliases:
- TTS_GENERATION
- CREATE_TTS
- TEXT2SPEECH
- T2S
- TEXT_TO_SPEECH
- AUDIO_CREATE
Features:
- Automatic language detection
- Voice selection based on detected language
- Local file caching
- Progress tracking
- Error handling
Common Issues & Troubleshooting​
-
Generation Failures
- Verify FAL API key is correctly set
- Ensure text input is at least 3 characters long
- Check network connectivity to FAL.ai services
-
Storage Issues
- Verify write permissions to content_cache directory
- Ensure sufficient disk space
- Check if content_cache directory exists
Security Best Practices​
- API Key Management
- Store FAL API key securely using runtime settings or environment variables
- Never commit API keys to version control
- Monitor API usage
Development Guide​
Setting Up Development Environment​
- Clone the repository
- Install dependencies:
bun install
- Build the plugin:
bun run build
- Run the plugin:
bun run dev
Future Enhancements​
-
Advanced Voice Features
- Custom voice model support
- Voice style transfer
- Emotion control
- Speech rate adjustment
- Pitch modification
- Multiple speaker support
-
Audio Processing
- Background noise reduction
- Audio quality enhancement
- Format conversion options
- Volume normalization
- Audio effects processing
- Batch processing support
-
Language Support
- Expanded language detection
- Regional accent support
- Dialect customization
- Pronunciation improvements
- Multi-language mixing
- Custom pronunciation rules
-
Integration Features
- Streaming audio support
- Real-time generation
- Caching optimization
- Batch generation
- Queue management
- Progress monitoring
-
Developer Tools
- Extended API options
- Testing framework
- Performance profiling
- Debug logging
- Integration examples
- Documentation generator
We welcome community feedback and contributions to help prioritize these enhancements.
Contributing​
Contributions are welcome! Please see the CONTRIBUTING.md file for more information.
Credits​
This plugin integrates with and builds upon several key technologies:
- FAL.ai: AI model deployment platform
- langdetect: Language detection library
- ElizaOS: Core framework
Special thanks to:
- The FAL.ai team for AI infrastructure
- The langdetect development community
- The Eliza community for their contributions and feedback
For more information about TTS capabilities:
License​
This plugin is part of the Eliza project. See the main project repository for license information.