Go to file
samerbam 9cc2927beb
All checks were successful
Gitea Actions Demo / Explore-Gitea-Actions (push) Successful in 4s
readd skills. with ifmain statements for individual file running
2023-09-14 10:34:27 -04:00
.gitea/workflows add gitea action, add todo 2023-09-11 22:51:57 -04:00
backend readd skills. with ifmain statements for individual file running 2023-09-14 10:34:27 -04:00
frontend Alarm inital copy from Timer, Inital Phone skill, readme updates 2023-09-06 20:26:19 -04:00
scratch added auth.py. renamed setup.sh to setup-mac.sh 2023-06-14 12:04:04 -04:00
.gitignore update .gitignore to ignore anything with config.py at end of name 2023-09-05 12:08:44 -04:00
README.md custom timeparser, 130 lines of the most annoying code. 2023-09-11 19:39:36 -04:00

A Siri like AI Assistant

  • Uses ChatGPT (or alternative LLM) for general queries
  • Uses Wolfram Alpha API for anything math related
  • Has built in NLP (using a NLI model) for determining if we can process query locally (skills system)
  • Frontend/Backend architecture for ability to deploy lightweight clients

Skills

  • Translations
  • Alarms (potentally complete, if we use Timers logic)
  • Calendar
  • Gmail
  • ChatGPT
  • Reminders
  • Timers - TODO: Adding in sound notifications.
  • Todos
  • Weather
  • Wolfram
  • NLP
  • Speech to Text (frontend for sure)
  • Phone
    • inital implementation where the number is sent to the phone
    • NLP name to check contact
    • iCloud Contact API
  • API
    • Authentication
    • General API
  • TTS
    • generate audio on backend or frontend?
      • Perks of backend is fast generation
      • Cons of backend is large file transfers between devices, lots of internet usage
      • Perks of frontend is less data transfer between devices requiring less internet usage
      • Cons of frontend is slower generation
    • Current Solution: https://github.com/synesthesiam/opentts

API Specs

Using websockets for communication allows for two way communication where the server can send the client info at any point Link for example: https://stackoverflow.com/questions/53331127/python-websockets-send-to-client-and-keep-connection-alive More examples (includes jwt authentication, though this is in node.js, still useful for figuring out how to do this stuff): https://www.linode.com/docs/guides/authenticating-over-websockets-with-jwt/

Ideas

  • Dashboard with api call counts (would require linking into all active skills, callbacks with class inheritance maybe?)
  • Phone calls from Jarvis speaker
  • JARVIS, initiate the House Party Protocol (takeover screen and show retro style text interface, possibly showing data from dashboard)

Wants, but limitations prevent

  • tumble weed bounces by Oh, dear.