feat: Add audiobook generation feature, including new database files, image assets, and README updates.

This commit is contained in:
2026-03-11 17:00:24 +08:00
parent 14def62d3b
commit 4408b836e1
5 changed files with 22 additions and 0 deletions

View File

@@ -13,6 +13,7 @@
- Custom Voice: Predefined speaker voices
- Voice Design: Create voices from natural language descriptions
- Voice Cloning: Clone voices from uploaded audio
- Audiobook Generation: Upload EPUB files and generate multi-character audiobooks with LLM-powered character extraction and voice assignment
- Dual Backend Support: Switch between local model and Aliyun TTS API
- Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어
- JWT auth, async tasks, voice cache, dark mode
@@ -33,6 +34,16 @@
</tr>
</table>
### Audiobook Generation
![Audiobook Overview](./images/audiobook-overview.png)
<table>
<tr>
<td width="50%"><img src="./images/audiobook-characters.png" alt="Audiobook Characters" /></td>
<td width="50%"><img src="./images/audiobook-chapters.png" alt="Audiobook Chapters" /></td>
</tr>
</table>
## Tech Stack
**Backend**: FastAPI + SQLAlchemy + PyTorch + JWT