Commit Graph

67 Commits

Author SHA1 Message Date
6eb521dee4 feat: add synopsis generation endpoint and frontend integration 2026-03-13 11:42:30 +08:00
35bf7a302a feat: Implement AI script generation for audiobook projects 2026-03-13 11:29:56 +08:00
eca99da084 feat: improve character preview generation and adjust emotion handling guidelines in LLM service 2026-03-13 00:46:49 +08:00
c2e1ee0289 feat: enhance project audio directory management by clearing segments and chapters during analysis and identification 2026-03-12 23:57:13 +08:00
29799a8c7d feat: refactor voice bootstrap logic and improve error handling in audio generation 2026-03-12 23:47:52 +08:00
233c4a9a98 feat: enhance emotion handling in chapter segment parsing with clearer guidelines and intensity recommendations 2026-03-12 23:38:44 +08:00
fa54208b96 feat: update button sizes and styles for improved UI consistency in Audiobook and button components 2026-03-12 18:33:44 +08:00
afb6830a6d feat: add force option to project generation for reprocessing completed segments 2026-03-12 17:19:37 +08:00
7f25dd09f6 feat: add admin usage statistics and LLM configuration management 2026-03-12 16:30:24 +08:00
bb6ad9b0a3 feat: Implement segment update and regeneration features in Audiobook API and frontend 2026-03-12 15:48:35 +08:00
a1ee476e0f feat: Enhance narrator description and instructions in LLMService and Audiobook components 2026-03-12 15:42:53 +08:00
475df0c9ca Refactor code structure for improved readability and maintainability 2026-03-12 15:37:08 +08:00
bbcfc0e8d3 feat: Add emotion handling to audiobook segments with emo_text and emo_alpha attributes 2026-03-12 14:34:20 +08:00
c79ffac6d9 fix: Enhance emotion vector calculation in IndexTTS2Backend with emo_alpha adjustment 2026-03-12 13:50:21 +08:00
8aec4f6f44 feat: Integrate IndexTTS2 model and update related schemas and frontend components 2026-03-12 13:30:53 +08:00
29bd45e0e0 fix: Remove enable_thinking parameter from stream_chat methods 2026-03-11 19:09:17 +08:00
4f0d9f5ed6 fix: Adjust chunk size in parse_one_chapter to 1500 and add enable_thinking parameter to LLMService methods 2026-03-11 19:05:03 +08:00
b6d4d2d5f2 feat: Enhance stream_chat methods to accept max_tokens parameter for improved token management 2026-03-11 18:47:22 +08:00
f9a0e2bcc4 refactor: Simplify SQLite checks by introducing a variable for database type 2026-03-11 18:04:16 +08:00
d9082b12a8 feat: Validate LLM configuration by sending a test request during API key update. 2026-03-11 17:32:54 +08:00
14def62d3b feat: introduce new feature with database persistence and refine cancel event resolution logic. 2026-03-11 16:50:52 +08:00
0d8756ebab feat: Implement generation cancellation for projects, update project status handling, and mark chapters as done upon segment completion. 2026-03-11 16:37:33 +08:00
44c39f1456 Removed direct instantiation of ProgressStore in audiobook service and added new feature documentation. 2026-03-11 16:30:51 +08:00
ffd3d6675d feat: Implement gender-specific TTS instructions, refactor async database session handling for character creation and preview generation, and add Aliyun voice design creation. 2026-03-11 15:58:14 +08:00
d3c6297a09 feat: Implement character voice preview playback and regeneration, and add a turbo mode status indicator for audiobook projects. 2026-03-11 15:36:43 +08:00
5dded459fc feat: Implement startup logic to reset stale audiobook chapter parsing and segment generation statuses to pending. 2026-03-11 14:42:00 +08:00
264b511228 feat: Implement functionality to retry only failed audiobook chapters and refine UI for batch operations. 2026-03-11 14:37:41 +08:00
d96089a2aa feat: Automatically delete associated source files when an audiobook project is removed. 2026-03-11 14:28:11 +08:00
b7b6f5ef8e feat: Implement batch cancellation for audiobook processing with enhanced frontend progress display. 2026-03-11 14:22:35 +08:00
a0047d5c29 feat: Add batch processing for audiobook chapters including parse, generate, and combined process actions. 2026-03-11 14:08:09 +08:00
2e005b0084 feat(audiobook): add gender field to audiobook character model and update related functionality 2026-03-10 20:23:03 +08:00
1db41b6278 feat(audiobook): enhance chapter expansion functionality in ProjectCard component 2026-03-10 18:05:31 +08:00
bf7c73e57c feat(audiobook): change audio format from MP3 to WAV for project downloads and merging 2026-03-10 17:56:46 +08:00
006aa0c85f feat(audiobook): add turbo mode for project analysis and enhance log streaming with chapter support 2026-03-10 17:01:50 +08:00
3c30afc476 feat(audiobook): implement chapter management with CRUD operations and enhance project detail responses 2026-03-10 16:42:32 +08:00
01b6f4633e feat(audiobook): implement log streaming for project status updates and enhance progress tracking 2026-03-10 16:27:01 +08:00
230274bbc3 feat(audiobook): refactor background tasks to use asyncio for project analysis and generation 2026-03-10 16:13:35 +08:00
5037857dd4 Refactor audiobook service to extract chapters from EPUB files, implement chapter chunking, and enhance project analysis and generation flow 2026-03-09 19:04:13 +08:00
a68a343536 feat(llm_service): enhance chat_json error handling and improve character extraction prompt 2026-03-09 12:42:03 +08:00
6fec2eb937 feat(audiobook): implement character voice bootstrapping and enhance polling during project status transitions 2026-03-09 12:39:02 +08:00
e1dbb79564 refactor(tts_service): simplify audio data handling in LocalTTSBackend 2026-03-09 11:53:16 +08:00
9b6691bffe feat(audiobook): add endpoint to retrieve audio for a specific segment 2026-03-09 11:48:47 +08:00
a3d7d318e0 feat(audiobook): implement audiobook project management features 2026-03-09 11:39:36 +08:00
28218e6616 feat: update requirements.txt to include additional dependencies for torch, numpy, pydub, and requests 2026-03-09 10:43:46 +08:00
c880fb8949 feat: add Aliyun region configuration to .env.example 2026-03-06 16:33:24 +08:00
4081fe3754 feat: remove Aliyun region configuration from .env.example 2026-03-06 16:21:36 +08:00
38e00fd38c feat: add Docker deployment support and fix /users/me endpoint
- Add docker/ directory with Dockerfile for backend and frontend
- Backend: pytorch/pytorch CUDA base image with all qwen_tts deps
- Frontend: multi-stage nginx build with /api/ proxy to backend
- docker-compose.yml (CPU) + docker-compose.gpu.yml (GPU overlay)
- Fix /users/me returning 404 due to missing route (was caught by /{user_id})
- Update .gitignore to exclude docker/models, docker/data, docker/.env
- Update README and README.zh.md with Docker deployment instructions

Images: bdim404/qwen3-tts-backend, bdim404/qwen3-tts-frontend

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 15:15:27 +08:00
964ebb824c feat: Add voice management functionality with delete capability and UI integration 2026-03-06 14:35:59 +08:00
ad90e5f96c feat: Implement prepare-and-create endpoint for voice design creation and update related API and frontend logic 2026-03-06 14:23:15 +08:00
5e1e3e0668 refactor: Consolidate cache file loading logic and enhance cache saving for different data types 2026-03-06 14:07:22 +08:00