You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
42 lines
1.4 KiB
Markdown
42 lines
1.4 KiB
Markdown
# A Siri like AI Assistant
|
|
|
|
* Uses ChatGPT for general queries
|
|
* Uses Wolfram Alpha for anything math related
|
|
* Has built in NLP (using a NLI model) for determining if we can process query locally
|
|
* Frontend/Backend architecture for ability to deploy lightweight clients
|
|
|
|
|
|
## Skills
|
|
|
|
- [ ] Alarms
|
|
- [ ] Calendar
|
|
- [ ] Gmail
|
|
- [ ] ChatGPT
|
|
- [ ] Reminders
|
|
- [x] Timers - TODO: Adding in sound notifications.
|
|
- [ ] Todos
|
|
- [ ] Weather
|
|
- [ ] Wolfram
|
|
- [x] NLP
|
|
- [x] Speech to Text (frontend for sure)
|
|
- [ ] API
|
|
- [ ] Authentication
|
|
- [ ] General API
|
|
- [ ] TTS
|
|
- generate audio on backend or frontend?
|
|
- Perks of backend is fast generation
|
|
- Cons of backend is large file transfers between devices, lots of internet usage
|
|
- Perks of frontend is less data transfer between devices requiring less internet usage
|
|
- Cons of frontend is slower generation
|
|
- Current Solution: https://github.com/synesthesiam/opentts
|
|
|
|
|
|
## API Specs
|
|
|
|
Using websockets for communication allows for two way communication where the server can send the client info at any point
|
|
Link for example: https://stackoverflow.com/questions/53331127/python-websockets-send-to-client-and-keep-connection-alive
|
|
More examples (includes jwt authentication, though this is in node.js, still useful for figuring out how to do this stuff): https://www.linode.com/docs/guides/authenticating-over-websockets-with-jwt/
|
|
|
|
## Ideas
|
|
|
|
* Dashboard with api call counts |