feat: Add audiobook generation feature, including new database files, image assets, and README updates.

This commit is contained in:
2026-03-11 17:00:24 +08:00
parent 14def62d3b
commit 4408b836e1
5 changed files with 22 additions and 0 deletions

View File

@@ -13,6 +13,7 @@
- Custom Voice: Predefined speaker voices - Custom Voice: Predefined speaker voices
- Voice Design: Create voices from natural language descriptions - Voice Design: Create voices from natural language descriptions
- Voice Cloning: Clone voices from uploaded audio - Voice Cloning: Clone voices from uploaded audio
- Audiobook Generation: Upload EPUB files and generate multi-character audiobooks with LLM-powered character extraction and voice assignment
- Dual Backend Support: Switch between local model and Aliyun TTS API - Dual Backend Support: Switch between local model and Aliyun TTS API
- Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어 - Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어
- JWT auth, async tasks, voice cache, dark mode - JWT auth, async tasks, voice cache, dark mode
@@ -33,6 +34,16 @@
</tr> </tr>
</table> </table>
### Audiobook Generation
![Audiobook Overview](./images/audiobook-overview.png)
<table>
<tr>
<td width="50%"><img src="./images/audiobook-characters.png" alt="Audiobook Characters" /></td>
<td width="50%"><img src="./images/audiobook-chapters.png" alt="Audiobook Chapters" /></td>
</tr>
</table>
## Tech Stack ## Tech Stack
**Backend**: FastAPI + SQLAlchemy + PyTorch + JWT **Backend**: FastAPI + SQLAlchemy + PyTorch + JWT

View File

@@ -13,6 +13,7 @@
- 自定义语音:预定义说话人语音 - 自定义语音:预定义说话人语音
- 语音设计:自然语言描述创建语音 - 语音设计:自然语言描述创建语音
- 语音克隆:上传音频克隆语音 - 语音克隆:上传音频克隆语音
- 有声书生成:上传 EPUB 文件,通过 LLM 自动提取角色并分配语音,生成多角色有声书
- 双后端支持:支持本地模型和阿里云 TTS API 切换 - 双后端支持:支持本地模型和阿里云 TTS API 切换
- 多语言支持English、简体中文、繁體中文、日本語、한국어 - 多语言支持English、简体中文、繁體中文、日本語、한국어
- JWT 认证、异步任务、语音缓存、暗黑模式 - JWT 认证、异步任务、语音缓存、暗黑模式
@@ -33,6 +34,16 @@
</tr> </tr>
</table> </table>
### 有声书生成
![有声书概览](./images/audiobook-overview.png)
<table>
<tr>
<td width="50%"><img src="./images/audiobook-characters.png" alt="有声书角色列表" /></td>
<td width="50%"><img src="./images/audiobook-chapters.png" alt="有声书章节列表" /></td>
</tr>
</table>
## 技术栈 ## 技术栈
**后端**: FastAPI + SQLAlchemy + PyTorch + JWT **后端**: FastAPI + SQLAlchemy + PyTorch + JWT

Binary file not shown.

After

Width:  |  Height:  |  Size: 164 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 188 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 209 KiB