Canto

bdim/Canto

Author	SHA1	Message	Date
bdim404	6eb521dee4	feat: add synopsis generation endpoint and frontend integration	2026-03-13 11:42:30 +08:00
bdim404	35bf7a302a	feat: Implement AI script generation for audiobook projects	2026-03-13 11:29:56 +08:00
bdim404	eca99da084	feat: improve character preview generation and adjust emotion handling guidelines in LLM service	2026-03-13 00:46:49 +08:00
bdim404	c2e1ee0289	feat: enhance project audio directory management by clearing segments and chapters during analysis and identification	2026-03-12 23:57:13 +08:00
bdim404	29799a8c7d	feat: refactor voice bootstrap logic and improve error handling in audio generation	2026-03-12 23:47:52 +08:00
bdim404	233c4a9a98	feat: enhance emotion handling in chapter segment parsing with clearer guidelines and intensity recommendations	2026-03-12 23:38:44 +08:00
bdim404	fa54208b96	feat: update button sizes and styles for improved UI consistency in Audiobook and button components	2026-03-12 18:33:44 +08:00
bdim404	afb6830a6d	feat: add force option to project generation for reprocessing completed segments	2026-03-12 17:19:37 +08:00
bdim404	7f25dd09f6	feat: add admin usage statistics and LLM configuration management	2026-03-12 16:30:24 +08:00
bdim404	bb6ad9b0a3	feat: Implement segment update and regeneration features in Audiobook API and frontend	2026-03-12 15:48:35 +08:00
bdim404	a1ee476e0f	feat: Enhance narrator description and instructions in LLMService and Audiobook components	2026-03-12 15:42:53 +08:00
bdim404	475df0c9ca	Refactor code structure for improved readability and maintainability	2026-03-12 15:37:08 +08:00
bdim404	bbcfc0e8d3	feat: Add emotion handling to audiobook segments with emo_text and emo_alpha attributes	2026-03-12 14:34:20 +08:00
bdim404	c79ffac6d9	fix: Enhance emotion vector calculation in IndexTTS2Backend with emo_alpha adjustment	2026-03-12 13:50:21 +08:00
bdim404	8aec4f6f44	feat: Integrate IndexTTS2 model and update related schemas and frontend components	2026-03-12 13:30:53 +08:00
bdim404	29bd45e0e0	fix: Remove enable_thinking parameter from stream_chat methods	2026-03-11 19:09:17 +08:00
bdim404	4f0d9f5ed6	fix: Adjust chunk size in parse_one_chapter to 1500 and add enable_thinking parameter to LLMService methods	2026-03-11 19:05:03 +08:00
bdim404	b6d4d2d5f2	feat: Enhance stream_chat methods to accept max_tokens parameter for improved token management	2026-03-11 18:47:22 +08:00
bdim404	f9a0e2bcc4	refactor: Simplify SQLite checks by introducing a variable for database type	2026-03-11 18:04:16 +08:00
bdim404	d9082b12a8	feat: Validate LLM configuration by sending a test request during API key update.	2026-03-11 17:32:54 +08:00
bdim404	14def62d3b	feat: introduce new feature with database persistence and refine cancel event resolution logic.	2026-03-11 16:50:52 +08:00
bdim404	0d8756ebab	feat: Implement generation cancellation for projects, update project status handling, and mark chapters as done upon segment completion.	2026-03-11 16:37:33 +08:00
bdim404	44c39f1456	Removed direct instantiation of ProgressStore in audiobook service and added new feature documentation.	2026-03-11 16:30:51 +08:00
bdim404	ffd3d6675d	feat: Implement gender-specific TTS instructions, refactor async database session handling for character creation and preview generation, and add Aliyun voice design creation.	2026-03-11 15:58:14 +08:00
bdim404	d3c6297a09	feat: Implement character voice preview playback and regeneration, and add a turbo mode status indicator for audiobook projects.	2026-03-11 15:36:43 +08:00
bdim404	5dded459fc	feat: Implement startup logic to reset stale audiobook chapter parsing and segment generation statuses to pending.	2026-03-11 14:42:00 +08:00
bdim404	264b511228	feat: Implement functionality to retry only failed audiobook chapters and refine UI for batch operations.	2026-03-11 14:37:41 +08:00
bdim404	d96089a2aa	feat: Automatically delete associated source files when an audiobook project is removed.	2026-03-11 14:28:11 +08:00
bdim404	b7b6f5ef8e	feat: Implement batch cancellation for audiobook processing with enhanced frontend progress display.	2026-03-11 14:22:35 +08:00
bdim404	a0047d5c29	feat: Add batch processing for audiobook chapters including parse, generate, and combined process actions.	2026-03-11 14:08:09 +08:00
bdim404	2e005b0084	feat(audiobook): add gender field to audiobook character model and update related functionality	2026-03-10 20:23:03 +08:00
bdim404	1db41b6278	feat(audiobook): enhance chapter expansion functionality in ProjectCard component	2026-03-10 18:05:31 +08:00
bdim404	bf7c73e57c	feat(audiobook): change audio format from MP3 to WAV for project downloads and merging	2026-03-10 17:56:46 +08:00
bdim404	006aa0c85f	feat(audiobook): add turbo mode for project analysis and enhance log streaming with chapter support	2026-03-10 17:01:50 +08:00
bdim404	3c30afc476	feat(audiobook): implement chapter management with CRUD operations and enhance project detail responses	2026-03-10 16:42:32 +08:00
bdim404	01b6f4633e	feat(audiobook): implement log streaming for project status updates and enhance progress tracking	2026-03-10 16:27:01 +08:00
bdim404	230274bbc3	feat(audiobook): refactor background tasks to use asyncio for project analysis and generation	2026-03-10 16:13:35 +08:00
bdim404	5037857dd4	Refactor audiobook service to extract chapters from EPUB files, implement chapter chunking, and enhance project analysis and generation flow	2026-03-09 19:04:13 +08:00
bdim404	a68a343536	feat(llm_service): enhance chat_json error handling and improve character extraction prompt	2026-03-09 12:42:03 +08:00
bdim404	6fec2eb937	feat(audiobook): implement character voice bootstrapping and enhance polling during project status transitions	2026-03-09 12:39:02 +08:00
bdim404	e1dbb79564	refactor(tts_service): simplify audio data handling in LocalTTSBackend	2026-03-09 11:53:16 +08:00
bdim404	9b6691bffe	feat(audiobook): add endpoint to retrieve audio for a specific segment	2026-03-09 11:48:47 +08:00
bdim404	a3d7d318e0	feat(audiobook): implement audiobook project management features	2026-03-09 11:39:36 +08:00
bdim404	28218e6616	feat: update requirements.txt to include additional dependencies for torch, numpy, pydub, and requests	2026-03-09 10:43:46 +08:00
bdim404	c880fb8949	feat: add Aliyun region configuration to .env.example	2026-03-06 16:33:24 +08:00
bdim404	4081fe3754	feat: remove Aliyun region configuration from .env.example	2026-03-06 16:21:36 +08:00
bdim404	38e00fd38c	feat: add Docker deployment support and fix /users/me endpoint - Add docker/ directory with Dockerfile for backend and frontend - Backend: pytorch/pytorch CUDA base image with all qwen_tts deps - Frontend: multi-stage nginx build with /api/ proxy to backend - docker-compose.yml (CPU) + docker-compose.gpu.yml (GPU overlay) - Fix /users/me returning 404 due to missing route (was caught by /{user_id}) - Update .gitignore to exclude docker/models, docker/data, docker/.env - Update README and README.zh.md with Docker deployment instructions Images: bdim404/qwen3-tts-backend, bdim404/qwen3-tts-frontend Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 15:15:27 +08:00
bdim404	964ebb824c	feat: Add voice management functionality with delete capability and UI integration	2026-03-06 14:35:59 +08:00
bdim404	ad90e5f96c	feat: Implement prepare-and-create endpoint for voice design creation and update related API and frontend logic	2026-03-06 14:23:15 +08:00
bdim404	5e1e3e0668	refactor: Consolidate cache file loading logic and enhance cache saving for different data types	2026-03-06 14:07:22 +08:00

1 2

67 Commits