17 Comments - Last post 16 minutes ago by artion33
5 Comments - Last post 17 minutes ago by DeliberateTaco
2,650 Comments - Last post 24 minutes ago by MeguminShiro
295 Comments - Last post 35 minutes ago by maddima
47,368 Comments - Last post 1 hour ago by drschnell
30 Comments - Last post 1 hour ago by Atombomb2097
1,535 Comments - Last post 1 hour ago by gszp
665 Comments - Last post 4 minutes ago by mourinhos86
94 Comments - Last post 33 minutes ago by davidpfarrell
60 Comments - Last post 1 hour ago by RosimInc
315 Comments - Last post 1 hour ago by CultofPersonalitea
3,952 Comments - Last post 1 hour ago by Midnight12891
31 Comments - Last post 2 hours ago by Csiki
8,621 Comments - Last post 2 hours ago by sassdrake
So, a mate of mine starts rambling about this project idea of his — keeping it vague for now 'cause he's gonna try and market it or something — but basically it’s gonna involve AI, voice, the works. Since I’ve been off work recovering from some delightful dental surgery (10/10 don’t recommend), he asked if I could whip up a basic offline AI to help with his prototype.
One week later, in between games and wrangling the kids, I’ve somehow ended up knee-deep in a full-on desktop AI assistant. I’m calling it Version 0.8 for now, with my “MVP” version being 1.0.
Right now it uses FFmpeg, Whisper, LLaMA3, and Coqui TTS. It handles both text and voice input/output, caches WAVs, convos, user settings, and has a few colour themes 'cause who doesn’t love a bit of flair. Currently working on per-conversation caching and trying to make convos reference each other — which is as fun as it sounds.
Also, the AI voice? Sounds like a half-baked call centre operator. Absolutely cooked. I’m adding more voice options soon so it stops sounding like a robo-Karen trying to upsell me internet plans.
Performance-wise, I’ve managed to take voice response from "go make a cuppa" times down to about 6–8 seconds, thanks to streaming chunked WAVs and throwing the GPU at it. Still not lightning, but hey, it’s no longer yelling into the void and waiting for enlightenment.
Anyway, point is — since I was putting together a train anyway, thought I’d ask: anyone got feature ideas? Already blown past what my mate expected, so I’ve got a pretty hefty roadmap going. But I’m all ears for wild suggestions, practical or ridiculous.
Here is your entry to a progressive train. Good Luck and Enjoy ^^
Just finalised the addition of allowing the creation of different conversations, user defined conversation titles, conversational tabbing, persistent / cached conversations and deleting conversations ^^ Currently the entire App is 755 Megabytes. Let's watch that expand >.<
Comment has been collapsed.