Commit Graph

91 Commits

Author SHA1 Message Date
1db41b6278 feat(audiobook): enhance chapter expansion functionality in ProjectCard component 2026-03-10 18:05:31 +08:00
bf7c73e57c feat(audiobook): change audio format from MP3 to WAV for project downloads and merging 2026-03-10 17:56:46 +08:00
006aa0c85f feat(audiobook): add turbo mode for project analysis and enhance log streaming with chapter support 2026-03-10 17:01:50 +08:00
11d44fd0be feat(audiobook): enhance LogStream component and add bulk processing for chapter tasks 2026-03-10 16:59:55 +08:00
3c30afc476 feat(audiobook): implement chapter management with CRUD operations and enhance project detail responses 2026-03-10 16:42:32 +08:00
01b6f4633e feat(audiobook): implement log streaming for project status updates and enhance progress tracking 2026-03-10 16:27:01 +08:00
230274bbc3 feat(audiobook): refactor background tasks to use asyncio for project analysis and generation 2026-03-10 16:13:35 +08:00
5037857dd4 Refactor audiobook service to extract chapters from EPUB files, implement chapter chunking, and enhance project analysis and generation flow 2026-03-09 19:04:13 +08:00
a68a343536 feat(llm_service): enhance chat_json error handling and improve character extraction prompt 2026-03-09 12:42:03 +08:00
6fec2eb937 feat(audiobook): implement character voice bootstrapping and enhance polling during project status transitions 2026-03-09 12:39:02 +08:00
109ec25246 feat(audiobook): enhance segment polling during project status changes 2026-03-09 12:03:26 +08:00
f20b250430 feat(audiobook): implement SequentialPlayer for audio segment playback 2026-03-09 12:00:03 +08:00
e1dbb79564 refactor(tts_service): simplify audio data handling in LocalTTSBackend 2026-03-09 11:53:16 +08:00
9b6691bffe feat(audiobook): add endpoint to retrieve audio for a specific segment 2026-03-09 11:48:47 +08:00
a3d7d318e0 feat(audiobook): implement audiobook project management features 2026-03-09 11:39:36 +08:00
28218e6616 feat: update requirements.txt to include additional dependencies for torch, numpy, pydub, and requests 0.0.1 2026-03-09 10:43:46 +08:00
8966dcc969 chore: remove backend URL setup instructions from README files 2026-03-06 17:55:05 +08:00
37693eb60a feat: remove frontend configuration and API section from README files 2026-03-06 17:44:50 +08:00
a7c726195c chore: sync qwen_tts from upstream QwenLM/Qwen3-TTS@main 2026-03-06 17:36:53 +08:00
2f309d7e4c chore: remove qwen_tts before subtree setup 2026-03-06 17:35:16 +08:00
dc9feaac46 feat: remove production environment variables from .env.production 2026-03-06 16:57:30 +08:00
d928087e79 feat: update docker-compose to use pre-built images for backend and frontend 2026-03-06 16:57:03 +08:00
c880fb8949 feat: add Aliyun region configuration to .env.example 2026-03-06 16:33:24 +08:00
4081fe3754 feat: remove Aliyun region configuration from .env.example 2026-03-06 16:21:36 +08:00
0ca1a9823b feat: add GitHub Actions workflows for publishing backend and frontend Docker images 2026-03-06 16:20:08 +08:00
8a9ed60add feat: add GitHub Actions workflow for Docker image publishing and update .gitignore for nginx.conf 2026-03-06 16:12:28 +08:00
38e00fd38c feat: add Docker deployment support and fix /users/me endpoint
- Add docker/ directory with Dockerfile for backend and frontend
- Backend: pytorch/pytorch CUDA base image with all qwen_tts deps
- Frontend: multi-stage nginx build with /api/ proxy to backend
- docker-compose.yml (CPU) + docker-compose.gpu.yml (GPU overlay)
- Fix /users/me returning 404 due to missing route (was caught by /{user_id})
- Update .gitignore to exclude docker/models, docker/data, docker/.env
- Update README and README.zh.md with Docker deployment instructions

Images: bdim404/qwen3-tts-backend, bdim404/qwen3-tts-frontend

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 15:15:27 +08:00
964ebb824c feat: Add voice management functionality with delete capability and UI integration 2026-03-06 14:35:59 +08:00
ad90e5f96c feat: Implement prepare-and-create endpoint for voice design creation and update related API and frontend logic 2026-03-06 14:23:15 +08:00
5e1e3e0668 refactor: Consolidate cache file loading logic and enhance cache saving for different data types 2026-03-06 14:07:22 +08:00
0cbf629499 refactor: Simplify README and remove outdated images; enhance Navbar with Home link 2026-03-06 13:56:03 +08:00
c35bf0ed00 style: Adjust AudioPlayer component CSS for improved layout and consistency 2026-03-06 13:30:00 +08:00
cf83811277 docs: Add warning notice about project stability in README files 2026-03-06 13:28:35 +08:00
01d7cf8fc9 style: Update Navbar and Home components with background color adjustments 2026-03-06 13:24:44 +08:00
abfd7b8f41 refactor: Remove unused text prop and related logic from AudioPlayer component 2026-03-06 12:09:09 +08:00
a93754f449 feat: Enhance API interactions and improve job handling with new request validation and error management 2026-03-06 12:03:41 +08:00
3844e825cd fix: Update repository clone URL and adjust huggingface-cli commands in README files 2026-02-13 11:57:00 +08:00
27c8925f7d refactor: Optimize font loading by removing unnecessary async/await and streamline language handling 2026-02-06 18:51:00 +08:00
8f5cfd8093 feat: Replace react-h5-audio-player with @arraypress/waveform-player and update AudioPlayer component to support waveform visualization 2026-02-06 18:50:06 +08:00
26e40039a9 feat: Refactor voice cloning job submission to use FormData and update VoiceDesignForm instruct property to be optional 2026-02-06 15:59:16 +08:00
cbf906574c feat: Update voice design tab labels for consistency across languages 2026-02-06 14:35:20 +08:00
d8488534b9 feat: Improve Textarea height adjustment and overflow handling 2026-02-06 14:32:42 +08:00
9966652542 feat: Update Navbar and HistorySidebar layout for improved navigation and styling 2026-02-06 14:28:13 +08:00
d0dcd655fd feat: Update font file references to use noto-serif-latin-regular.woff2 2026-02-06 14:14:18 +08:00
5c0111a7a2 feat: Enhance README with project description, installation instructions, and acknowledgments 2026-02-06 14:13:14 +08:00
2d2c4e9f98 feat: Add font loading functionality for multi-language support and preload base font 2026-02-06 14:09:22 +08:00
9e61734e25 refactor: Remove unused AudioLines import and update Navbar logo rendering 2026-02-06 14:06:09 +08:00
70a4d87aae style: Update HistorySidebar styles for improved layout and spacing 2026-02-06 14:00:07 +08:00
a756a31479 feat: Update README with desktop and mobile interface previews; add new images for light/dark modes, settings, and history 2026-02-05 23:51:40 +08:00
a88a31ef86 feat: Add multi-language support and interface previews to README; include new images for voice design and cloning features 2026-02-05 23:42:12 +08:00