Jarvis: an Experiment Leveraging the OpenAI API, Apple Homekit, and Siri

Modular, tailored to my needs, and an entire lot more useful than Siri

I’m a pc nerd, and I at all times have been, so when ChatGPT landed within the news I immediately got enthusiastic about the chances. As an analyst/business owner, I wondered if it could possibly be used to make my team’s and client’s lives easier … but figured I needed some experience with the technology first.

Like everyone else, I began asking Chat GPT questions, mostly mundane stuff, and that was fun. Then I discovered how good the platform is at helping people write code!

Last 12 months I built a Python and JavaScript system for keeping track of automotive collections called the Auto Asset Manager. It really works advantageous, but since I used to be latest to Python and my JavaScript was pretty rusty, quite a lot of the code was suboptimal. I used Chat GPT to repair some stuff and it was good! I might just say “what’s one of the best option to access information in a multidimensional array in Python” and it could mainly send me the precise code I needed.

I loved it.

But then I saw this post on combining OpenAI with Apple Homekit via Siri and was inspired. I take advantage of Homekit so much, but truthfully, I don’t use Siri as a part of it because, well, she’s not very sophisticated.

But Jarvis is.

I began with Mate’s shortcut and kind of ended up re-writing it to be a) more modular and b) more tailored to my needs. I ended up with a command shortcut called “Get Jarvis” and three child shortcuts to handle commands, queries, and the necessity to translate the responses different devices provide. I needed the translator because, like anyone with Homekit stuff, I run a plethora of systems using Homebridge and every system seems to present answers a unique way (e.g., Meross says a switch is “on” or “off” but Phillips says “yes” light is on or “no” it’s not and Remootio for my gate indicates “1” for open and “0” for closed, etc.)

Here’s a short video of what Jarvis is in a position to do for me (using Siri as a baseline for what is feasible today):