feat: Add audiobook generation feature, including new database files, image assets, and README updates.
This commit is contained in:
11
README.md
11
README.md
@@ -13,6 +13,7 @@
|
|||||||
- Custom Voice: Predefined speaker voices
|
- Custom Voice: Predefined speaker voices
|
||||||
- Voice Design: Create voices from natural language descriptions
|
- Voice Design: Create voices from natural language descriptions
|
||||||
- Voice Cloning: Clone voices from uploaded audio
|
- Voice Cloning: Clone voices from uploaded audio
|
||||||
|
- Audiobook Generation: Upload EPUB files and generate multi-character audiobooks with LLM-powered character extraction and voice assignment
|
||||||
- Dual Backend Support: Switch between local model and Aliyun TTS API
|
- Dual Backend Support: Switch between local model and Aliyun TTS API
|
||||||
- Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어
|
- Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어
|
||||||
- JWT auth, async tasks, voice cache, dark mode
|
- JWT auth, async tasks, voice cache, dark mode
|
||||||
@@ -33,6 +34,16 @@
|
|||||||
</tr>
|
</tr>
|
||||||
</table>
|
</table>
|
||||||
|
|
||||||
|
### Audiobook Generation
|
||||||
|

|
||||||
|
|
||||||
|
<table>
|
||||||
|
<tr>
|
||||||
|
<td width="50%"><img src="./images/audiobook-characters.png" alt="Audiobook Characters" /></td>
|
||||||
|
<td width="50%"><img src="./images/audiobook-chapters.png" alt="Audiobook Chapters" /></td>
|
||||||
|
</tr>
|
||||||
|
</table>
|
||||||
|
|
||||||
## Tech Stack
|
## Tech Stack
|
||||||
|
|
||||||
**Backend**: FastAPI + SQLAlchemy + PyTorch + JWT
|
**Backend**: FastAPI + SQLAlchemy + PyTorch + JWT
|
||||||
|
|||||||
11
README.zh.md
11
README.zh.md
@@ -13,6 +13,7 @@
|
|||||||
- 自定义语音:预定义说话人语音
|
- 自定义语音:预定义说话人语音
|
||||||
- 语音设计:自然语言描述创建语音
|
- 语音设计:自然语言描述创建语音
|
||||||
- 语音克隆:上传音频克隆语音
|
- 语音克隆:上传音频克隆语音
|
||||||
|
- 有声书生成:上传 EPUB 文件,通过 LLM 自动提取角色并分配语音,生成多角色有声书
|
||||||
- 双后端支持:支持本地模型和阿里云 TTS API 切换
|
- 双后端支持:支持本地模型和阿里云 TTS API 切换
|
||||||
- 多语言支持:English、简体中文、繁體中文、日本語、한국어
|
- 多语言支持:English、简体中文、繁體中文、日本語、한국어
|
||||||
- JWT 认证、异步任务、语音缓存、暗黑模式
|
- JWT 认证、异步任务、语音缓存、暗黑模式
|
||||||
@@ -33,6 +34,16 @@
|
|||||||
</tr>
|
</tr>
|
||||||
</table>
|
</table>
|
||||||
|
|
||||||
|
### 有声书生成
|
||||||
|

|
||||||
|
|
||||||
|
<table>
|
||||||
|
<tr>
|
||||||
|
<td width="50%"><img src="./images/audiobook-characters.png" alt="有声书角色列表" /></td>
|
||||||
|
<td width="50%"><img src="./images/audiobook-chapters.png" alt="有声书章节列表" /></td>
|
||||||
|
</tr>
|
||||||
|
</table>
|
||||||
|
|
||||||
## 技术栈
|
## 技术栈
|
||||||
|
|
||||||
**后端**: FastAPI + SQLAlchemy + PyTorch + JWT
|
**后端**: FastAPI + SQLAlchemy + PyTorch + JWT
|
||||||
|
|||||||
BIN
images/audiobook-chapters.png
Normal file
BIN
images/audiobook-chapters.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 164 KiB |
BIN
images/audiobook-characters.png
Normal file
BIN
images/audiobook-characters.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 188 KiB |
BIN
images/audiobook-overview.png
Normal file
BIN
images/audiobook-overview.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 209 KiB |
Reference in New Issue
Block a user