live-voice-chat

8 Commits

Include renames

Author	SHA1	Message	Date
bhetherman	44a10667c2	Enhance video handling and performance optimizations - Added environment variables to prevent CPU thread pools from busy-waiting. - Deferred loading of video models until first use to reduce VRAM footprint. - Implemented streaming of speaking clips for improved responsiveness. - Introduced a queue for managing speaking clips to handle multiple requests smoothly. - Updated video playback logic to ensure proper handling of clip generation.	2026-04-24 00:36:18 -04:00
bhetherman	129df7d1fa	working ok	2026-04-16 10:00:37 -04:00
bhetherman	fcf0be38bc	t5 encoder fp8 seems to be working	2026-04-12 13:50:34 -04:00
bhetherman	2818b41004	first stab at adding video	2026-04-12 04:11:52 -04:00
bhetherman	680c5b04cc	Pre-download the spacy model that kokoro needs at runtime	2026-04-08 11:43:26 -04:00
bhetherman	c07187aae1	add readme, remove torchvision	2026-04-08 04:08:42 -04:00
bhethermanandClaude Sonnet 4.6	263f39e0a3	Switch to CUDA 12.8 + ONNX-based VAD for RTX 5090 Blackwell support Upgrade PyTorch to 2.7+ with cu128 wheels for Blackwell (sm_120) GPU support. Replace silero-vad (which depends on torchaudio) with a direct ONNX Runtime implementation of the same Silero VAD model, eliminating the torchaudio dependency entirely. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 04:01:27 -04:00
bhetherman	0305f1dccd	updates for docker take 1	2026-04-08 03:05:26 -04:00