feat: Add multi-language support and interface previews to README; include new images for voice design and cloning features

This commit is contained in:
2026-02-05 23:42:12 +08:00
parent f9eaf88807
commit a88a31ef86
7 changed files with 74 additions and 0 deletions

View File

@@ -10,8 +10,45 @@ A text-to-speech web application based on Qwen3-TTS, supporting custom voice, vo
- Voice Design: Create voices from natural language descriptions
- Voice Cloning: Clone voices from uploaded audio
- Dual Backend Support: Switch between local model and Aliyun TTS API
- Multi-language Support: English, 简体中文, 繁體中文, 日本語, 한국어
- JWT auth, async tasks, voice cache, dark mode
## Interface Preview
### Light & Dark Mode
<table>
<tr>
<td width="50%">
<img src="./images/lightmode-english.png" alt="Light Mode" />
<p align="center"><em>Light Mode - Custom Voice</em></p>
</td>
<td width="50%">
<img src="./images/darkmode-chinese.png" alt="Dark Mode" />
<p align="center"><em>Dark Mode - Custom Voice</em></p>
</td>
</tr>
</table>
### Voice Design
<p align="center">
<img src="./images/custom-voice-list.png" alt="Voice Design List" width="80%" />
</p>
<p align="center"><em>Manage your custom voice designs</em></p>
<p align="center">
<img src="./images/save-voice-design-dialog.png" alt="Save Voice Design" width="60%" />
</p>
<p align="center"><em>Save voice design dialog</em></p>
### Voice Cloning
<p align="center">
<img src="./images/clone-voice-recording.png" alt="Voice Cloning" width="80%" />
</p>
<p align="center"><em>Clone voices by recording or uploading audio</em></p>
## Tech Stack
Backend: FastAPI + SQLAlchemy + PyTorch + JWT

View File

@@ -10,8 +10,45 @@
- 语音设计:自然语言描述创建语音
- 语音克隆:上传音频克隆语音
- 双后端支持:支持本地模型和阿里云 TTS API 切换
- 多语言支持English、简体中文、繁體中文、日本語、한국어
- JWT 认证、异步任务、语音缓存、暗黑模式
## 界面预览
### 亮色与暗黑模式
<table>
<tr>
<td width="50%">
<img src="./images/lightmode-english.png" alt="亮色模式" />
<p align="center"><em>亮色模式 - 自定义语音</em></p>
</td>
<td width="50%">
<img src="./images/darkmode-chinese.png" alt="暗黑模式" />
<p align="center"><em>暗黑模式 - 自定义语音</em></p>
</td>
</tr>
</table>
### 语音设计
<p align="center">
<img src="./images/custom-voice-list.png" alt="语音设计列表" width="80%" />
</p>
<p align="center"><em>管理您的自定义语音设计</em></p>
<p align="center">
<img src="./images/save-voice-design-dialog.png" alt="保存语音设计" width="60%" />
</p>
<p align="center"><em>保存语音设计对话框</em></p>
### 语音克隆
<p align="center">
<img src="./images/clone-voice-recording.png" alt="语音克隆" width="80%" />
</p>
<p align="center"><em>通过录音或上传音频克隆声音</em></p>
## 技术栈
后端FastAPI + SQLAlchemy + PyTorch + JWT

Binary file not shown.

After

Width:  |  Height:  |  Size: 164 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 210 KiB

BIN
images/darkmode-chinese.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 122 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 140 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 151 KiB