Commit Graph

15 Commits

Author SHA1 Message Date
bhetherman 44a10667c2 Enhance video handling and performance optimizations
- Added environment variables to prevent CPU thread pools from busy-waiting.
- Deferred loading of video models until first use to reduce VRAM footprint.
- Implemented streaming of speaking clips for improved responsiveness.
- Introduced a queue for managing speaking clips to handle multiple requests smoothly.
- Updated video playback logic to ensure proper handling of clip generation.
2026-04-24 00:36:18 -04:00
bhetherman 129df7d1fa working ok 2026-04-16 10:00:37 -04:00
bhetherman 9debc56137 Add LightX2V + Wan2.2-TI2V-5B-Turbo GGUF experiment
Benchmarks the dense 5B Turbo model (Q8_0 GGUF + fp8 T5) as a
lower-VRAM alternative to the 14B MoE pipeline. Includes dtype
patches for dense WanModel, Wan 2.2 VAE config (48 channels, 16x
spatial), and Blackwell fp8 workaround.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-04-16 01:27:45 -04:00
bhetherman 56923ff424 test passing 2026-04-12 16:38:44 -04:00
bhetherman fcf0be38bc t5 encoder fp8 seems to be working 2026-04-12 13:50:34 -04:00
bhetherman 2818b41004 first stab at adding video 2026-04-12 04:11:52 -04:00
bhetherman 680c5b04cc Pre-download the spacy model that kokoro needs at runtime 2026-04-08 11:43:26 -04:00
bhetherman d509f92a9d barge-in changes 2026-04-08 11:40:59 -04:00
bhetherman 175ed943df add KV-cache and move system promt to the config 2026-04-08 10:25:03 -04:00
bhetherman c7c4019ecc add lm studio option 2026-04-08 10:17:20 -04:00
bhetherman 08c5757b31 vad fix 2026-04-08 04:42:12 -04:00
bhetherman c07187aae1 add readme, remove torchvision 2026-04-08 04:08:42 -04:00
bhetherman 263f39e0a3 Switch to CUDA 12.8 + ONNX-based VAD for RTX 5090 Blackwell support
Upgrade PyTorch to 2.7+ with cu128 wheels for Blackwell (sm_120) GPU
support. Replace silero-vad (which depends on torchaudio) with a direct
ONNX Runtime implementation of the same Silero VAD model, eliminating
the torchaudio dependency entirely.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-08 04:01:27 -04:00
bhetherman 0305f1dccd updates for docker take 1 2026-04-08 03:05:26 -04:00
bhetherman ce41bca422 initial commit 2026-04-07 03:58:35 -04:00