live-voice-chat

14 Commits

Author	SHA1	Message	Date
bhetherman	129df7d1fa	working ok	2026-04-16 10:00:37 -04:00
bhethermanandClaude Opus 4.6	9debc56137	Add LightX2V + Wan2.2-TI2V-5B-Turbo GGUF experiment Benchmarks the dense 5B Turbo model (Q8_0 GGUF + fp8 T5) as a lower-VRAM alternative to the 14B MoE pipeline. Includes dtype patches for dense WanModel, Wan 2.2 VAE config (48 channels, 16x spatial), and Blackwell fp8 workaround. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 01:27:45 -04:00
bhetherman	56923ff424	test passing	2026-04-12 16:38:44 -04:00
bhetherman	fcf0be38bc	t5 encoder fp8 seems to be working	2026-04-12 13:50:34 -04:00
bhetherman	2818b41004	first stab at adding video	2026-04-12 04:11:52 -04:00
bhetherman	680c5b04cc	Pre-download the spacy model that kokoro needs at runtime	2026-04-08 11:43:26 -04:00
bhetherman	d509f92a9d	barge-in changes	2026-04-08 11:40:59 -04:00
bhetherman	175ed943df	add KV-cache and move system promt to the config	2026-04-08 10:25:03 -04:00
bhetherman	c7c4019ecc	add lm studio option	2026-04-08 10:17:20 -04:00
bhetherman	08c5757b31	vad fix	2026-04-08 04:42:12 -04:00
bhetherman	c07187aae1	add readme, remove torchvision	2026-04-08 04:08:42 -04:00
bhethermanandClaude Sonnet 4.6	263f39e0a3	Switch to CUDA 12.8 + ONNX-based VAD for RTX 5090 Blackwell support Upgrade PyTorch to 2.7+ with cu128 wheels for Blackwell (sm_120) GPU support. Replace silero-vad (which depends on torchaudio) with a direct ONNX Runtime implementation of the same Silero VAD model, eliminating the torchaudio dependency entirely. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 04:01:27 -04:00
bhetherman	0305f1dccd	updates for docker take 1	2026-04-08 03:05:26 -04:00
bhetherman	ce41bca422	initial commit	2026-04-07 03:58:35 -04:00