Canto

bdim/Canto

Author	SHA1	Message	Date
bdim404	475df0c9ca	Refactor code structure for improved readability and maintainability	2026-03-12 15:37:08 +08:00
bdim404	bbcfc0e8d3	feat: Add emotion handling to audiobook segments with emo_text and emo_alpha attributes	2026-03-12 14:34:20 +08:00
bdim404	c79ffac6d9	fix: Enhance emotion vector calculation in IndexTTS2Backend with emo_alpha adjustment	2026-03-12 13:50:21 +08:00
bdim404	8aec4f6f44	feat: Integrate IndexTTS2 model and update related schemas and frontend components	2026-03-12 13:30:53 +08:00
bdim404	29bd45e0e0	fix: Remove enable_thinking parameter from stream_chat methods	2026-03-11 19:09:17 +08:00
bdim404	4f0d9f5ed6	fix: Adjust chunk size in parse_one_chapter to 1500 and add enable_thinking parameter to LLMService methods	2026-03-11 19:05:03 +08:00
bdim404	b6d4d2d5f2	feat: Enhance stream_chat methods to accept max_tokens parameter for improved token management	2026-03-11 18:47:22 +08:00
bdim404	f9a0e2bcc4	refactor: Simplify SQLite checks by introducing a variable for database type	2026-03-11 18:04:16 +08:00
bdim404	d9082b12a8	feat: Validate LLM configuration by sending a test request during API key update.	2026-03-11 17:32:54 +08:00
bdim404	14def62d3b	feat: introduce new feature with database persistence and refine cancel event resolution logic.	2026-03-11 16:50:52 +08:00
bdim404	0d8756ebab	feat: Implement generation cancellation for projects, update project status handling, and mark chapters as done upon segment completion.	2026-03-11 16:37:33 +08:00
bdim404	44c39f1456	Removed direct instantiation of ProgressStore in audiobook service and added new feature documentation.	2026-03-11 16:30:51 +08:00
bdim404	ffd3d6675d	feat: Implement gender-specific TTS instructions, refactor async database session handling for character creation and preview generation, and add Aliyun voice design creation.	2026-03-11 15:58:14 +08:00
bdim404	d3c6297a09	feat: Implement character voice preview playback and regeneration, and add a turbo mode status indicator for audiobook projects.	2026-03-11 15:36:43 +08:00
bdim404	5dded459fc	feat: Implement startup logic to reset stale audiobook chapter parsing and segment generation statuses to pending.	2026-03-11 14:42:00 +08:00
bdim404	264b511228	feat: Implement functionality to retry only failed audiobook chapters and refine UI for batch operations.	2026-03-11 14:37:41 +08:00
bdim404	d96089a2aa	feat: Automatically delete associated source files when an audiobook project is removed.	2026-03-11 14:28:11 +08:00
bdim404	b7b6f5ef8e	feat: Implement batch cancellation for audiobook processing with enhanced frontend progress display.	2026-03-11 14:22:35 +08:00
bdim404	a0047d5c29	feat: Add batch processing for audiobook chapters including parse, generate, and combined process actions.	2026-03-11 14:08:09 +08:00
bdim404	2e005b0084	feat(audiobook): add gender field to audiobook character model and update related functionality	2026-03-10 20:23:03 +08:00
bdim404	1db41b6278	feat(audiobook): enhance chapter expansion functionality in ProjectCard component	2026-03-10 18:05:31 +08:00
bdim404	bf7c73e57c	feat(audiobook): change audio format from MP3 to WAV for project downloads and merging	2026-03-10 17:56:46 +08:00
bdim404	006aa0c85f	feat(audiobook): add turbo mode for project analysis and enhance log streaming with chapter support	2026-03-10 17:01:50 +08:00
bdim404	3c30afc476	feat(audiobook): implement chapter management with CRUD operations and enhance project detail responses	2026-03-10 16:42:32 +08:00
bdim404	01b6f4633e	feat(audiobook): implement log streaming for project status updates and enhance progress tracking	2026-03-10 16:27:01 +08:00
bdim404	230274bbc3	feat(audiobook): refactor background tasks to use asyncio for project analysis and generation	2026-03-10 16:13:35 +08:00
bdim404	5037857dd4	Refactor audiobook service to extract chapters from EPUB files, implement chapter chunking, and enhance project analysis and generation flow	2026-03-09 19:04:13 +08:00
bdim404	a68a343536	feat(llm_service): enhance chat_json error handling and improve character extraction prompt	2026-03-09 12:42:03 +08:00
bdim404	6fec2eb937	feat(audiobook): implement character voice bootstrapping and enhance polling during project status transitions	2026-03-09 12:39:02 +08:00
bdim404	e1dbb79564	refactor(tts_service): simplify audio data handling in LocalTTSBackend	2026-03-09 11:53:16 +08:00
bdim404	9b6691bffe	feat(audiobook): add endpoint to retrieve audio for a specific segment	2026-03-09 11:48:47 +08:00
bdim404	a3d7d318e0	feat(audiobook): implement audiobook project management features	2026-03-09 11:39:36 +08:00
bdim404	28218e6616	feat: update requirements.txt to include additional dependencies for torch, numpy, pydub, and requests	2026-03-09 10:43:46 +08:00
bdim404	c880fb8949	feat: add Aliyun region configuration to .env.example	2026-03-06 16:33:24 +08:00
bdim404	4081fe3754	feat: remove Aliyun region configuration from .env.example	2026-03-06 16:21:36 +08:00
bdim404	38e00fd38c	feat: add Docker deployment support and fix /users/me endpoint - Add docker/ directory with Dockerfile for backend and frontend - Backend: pytorch/pytorch CUDA base image with all qwen_tts deps - Frontend: multi-stage nginx build with /api/ proxy to backend - docker-compose.yml (CPU) + docker-compose.gpu.yml (GPU overlay) - Fix /users/me returning 404 due to missing route (was caught by /{user_id}) - Update .gitignore to exclude docker/models, docker/data, docker/.env - Update README and README.zh.md with Docker deployment instructions Images: bdim404/qwen3-tts-backend, bdim404/qwen3-tts-frontend Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 15:15:27 +08:00
bdim404	964ebb824c	feat: Add voice management functionality with delete capability and UI integration	2026-03-06 14:35:59 +08:00
bdim404	ad90e5f96c	feat: Implement prepare-and-create endpoint for voice design creation and update related API and frontend logic	2026-03-06 14:23:15 +08:00
bdim404	5e1e3e0668	refactor: Consolidate cache file loading logic and enhance cache saving for different data types	2026-03-06 14:07:22 +08:00
bdim404	a93754f449	feat: Enhance API interactions and improve job handling with new request validation and error management	2026-03-06 12:03:41 +08:00
bdim404	3844e825cd	fix: Update repository clone URL and adjust huggingface-cli commands in README files	2026-02-13 11:57:00 +08:00
bdim404	4f535b20e5	refactor: Remove cache and metrics endpoints, and clean up voice design CRUD operations	2026-02-04 19:41:08 +08:00
bdim404	73aacd174a	Remove file logging handler	2026-02-04 19:05:51 +08:00
bdim404	9e5d12c9fb	feat: Add voice design support for voice cloning and enhance cache management	2026-02-04 17:52:24 +08:00
bdim404	ddaa0abfc7	feat: Implement voice design management with CRUD operations and integrate into frontend	2026-02-04 13:57:20 +08:00
bdim404	6c25dd9dd9	feat: Add systemd service, configure API for proxy deployment, and enhance mobile audio playback with token authentication.	2026-02-03 21:53:41 +08:00
bdim404	18fd1f0fa5	feat: Integrate Aliyun TTS backend with dynamic speaker validation and listing, and adjust toast notification position. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 19:09:10 +08:00
bdim404	47e1411390	Update Local Permission Assignments	2026-02-03 18:48:25 +08:00
bdim404	d8a9f277be	feat: improve accessibility and mobile responsiveness across UI components Enhanced dialog accessibility by adding descriptive text to all dialog components. Implemented responsive layouts for mobile devices including card-based user table, adaptive navigation bar, and improved dialog spacing. Fixed Aliyun TTS health check to use WebSocket-based connectivity testing. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-03 18:03:38 +08:00
bdim404	244ff94c6a	feat: enhance audio processing and error handling in TTS backend; refactor user dialog form validation	2026-02-03 17:37:14 +08:00

1 2

56 Commits