How this visually impaired engineer uses Claude Code to make his life more accessible | Joe McCormick
Joe McCormick is a principal software engineer at Babylist who lost most of his central vision due to a rare genetic disorder right before starting college. He pivoted from mechanical engineering to computer science and now leads AI enablement at Babylist. Joe demonstrates how he uses AI to build micro Chrome extensions that make his everyday work and life more accessible, showing how personal software can address accessibility needs that mainstream products often overlook. What you’ll learn: How to build custom Chrome extensions in under 25 minutes using Claude Code A practical workflow for creating AI-powered accessibility tools How to use Claude Skills to accelerate repetitive development tasks Techniques for making Claude Code more screen reader accessible Why personal software is becoming increasingly viable with AI assistance How multimodal AI is transforming accessibility for visually impaired users — Brought to you by: Tines —Start building intelligent workflows today — Detailed workflow walkthroughs from this episode: • How I AI: Building Custom AI Accessibility Tools for Slack with Joe McCormick & Claude Code: https://www.chatprd.ai/how-i-ai/custom-ai-accessibility-tools-for-slack-claude-code • Build a Slack Link Summarizer from Scratch using Claude Code: https://www.chatprd.ai/how-i-ai/workflows/slack-link-summarizer-using-claude-code • Create a Fast, Accessible AI Spell Checker for Any Website: https://www.chatprd.ai/how-i-ai/workflows/accessible-ai-spell-checker-for-any-website
- Published
- Published Feb 16, 2026
- Uploaded
- Uploaded Jun 12, 2026
- File type
- POD
- Queried
- 00
- Source
- podcasters.spotify.com
Full transcript
Showing the full transcript for this episode.
AI-generated transcript with timestamped sections.
[00:00] Right before I started college, I ended up losing most of my central vision due to a rare genetic disorder called Lieber's Hereditary Optic Neuropathy. I was talking with someone who was losing their sight recently from the same disease, and they were asking about different things. [00:13] And I was like, oh, you can just do all of that now. With Gemini Arch, it has to be T. The world is a whole lot easier. So you're going to show us some of the things that you've built for yourself. So when someone sends me an image, I use this tool to be able to get the gist of an image without needing to ask somebody to explain it to me. If I hit Control Shift D. [00:31] on any message, it's gonna pop up and go off and describe that image for me. And the cool thing is I can go ask some follow ups. What age child is this for? [00:41] And we'll head off to Chachapiche and get the response for this as well. I'm curious for you, what are you most excited about in the multimodal world of AI? One thing that I was always afraid of, can I read stories? I can memorize stories, I can tell stories. But your son being like, I want to read this book. And you having to be like, sorry, I can't. And now that sorry, I can't becomes sorry, I can with the assistance of so many different tools now.
[01:05] Welcome back to How I AI. [01:11] you build better with these new tools. Today we have Joe McCormick, Principal Software Engineer at BabyList, who has a vision impairment and he's going to show us [01:20] how he uses AI to build microchrome apps to make his everyday life and work [01:25] a lot more accessible. You're going to learn how to use Claude code [01:29] to write Chrome apps, and you're going to be inspired at the little things you can do to make your own Slack a little bit more efficient. Let's get to it. This episode is brought to you by Tynes, the intelligent workflow platform powering the world's most important work. Business moves faster than the systems meant to support it. Teams are stuck with repetitive tasks, scattered tools, and hard to reach data. AI has huge promise, but struggles when everything underneath is fragmented. [01:59] unifies your tools, data, and processes in one secure, flexible platform, blending a JETTIC AI, automation, and human-led intervention. Teams get their time back, workflows run smarter, and AI actually delivers real value. Customers now automate over 1.5 billion actions every week. [02:29] at times.com slash how I AI. [02:34] Jo, thanks for joining How I AI. And I want you to spend a little bit of time introducing yourself and your story and how AI has impacted your ability to do work and build interesting things and engage in lots of awesome projects. And what's different about your life now with AI versus before? So, yeah, my name is Jo McCormick.
[03:00] I'm a principal software engineer at BabyList, and I think I took [03:04] maybe a little bit more interesting journey than most into the computer science world. [03:08] Um, so right before I started college, I ended up losing most of my central vision. [03:14] due to a rare genetic disorder? [03:16] called Lieber's Hereditary Optic Neuropathy. [03:19] And so before starting at Harvard, I was more interested into the mechanical world and kind of robotics and everything in that space. And then, uh. [03:28] found that that was a lot harder and doing things with my hands was becoming a lot harder month after month. [03:34] And so I took the intro to computer science course at Harvard and immediately fell in love and found that [03:40] I got the same feeling of creativity and being able to come up with the idea and make it happen. But now I was on [03:46] Uh, maybe not a full equal plane to my competitors at that time or my, my, uh, uh, other students, but. [03:52] then obviously as AI took off, became even more equivalent and [03:56] The gap between, um, I think software engineer for a sighted person individually impaired person is, is closing, um, day by day. Um. [04:04] And also in my personal life, I think it's [04:06] even been extra impactful. [04:08] I was talking with someone who was losing their sight. [04:11] recently from the same disease and they were asking about different things and I was like, [04:15] Oh, you can just do all of that now with sharing your, your stream with Gemini or ChatWPT. Whereas, uh, when I was, uh, first losing my faith, it was using different notification tools or, or even like. [04:25] glasses and things and it's like now [04:27] the world is a whole lot easier. I'm an avid meta glass user, um, and, and different things. My personal life a lot easier as well. Um,
[04:35] But yeah. [04:35] I do lots of AI product engineering now and I at BabyList [04:40] lead the AI enablement and trying to make sure all of our software engineers can [04:46] build with AI as practically as possible at all different parts of the software development lifecycle. [04:50] So you figured out a way, one, to adjust your interest in engineering to something that's a little bit more accessible for you. And then, two, lean into how these AI tools can really increase the accessibility and user experience of supportive technology that you've maybe used in the past, but that you've been able to make accessible. [05:11] better yourself and what I love about this personal software moment that we're in right now which is [05:18] Unfortunately, accessibility software and custom software that meets the needs of a lot of people [05:25] is simply in some instances not an economically viable business, for example, to build. And so in the kind of broader economic world, there's not a lot of incentive to build a full set [05:38] of robust tools that can meet the needs of everybody who deserves to have their needs meeting and [05:43] needs met. [05:45] And what I love about what we're able to do now with AI is not only work [05:50] are more interesting sort of accessibility tools and [05:55] and platforms being able to be built, but people can build these solutions for themselves and they can be very customized to [06:01] your experience, your needs, your strengths. And I think that's a really underappreciated
[06:07] benefit of [06:09] And so you're going to show us some of the things that you've built [06:13] for yourself and you're actually gonna walk us through your coding flow, which I think is really awesome on how to build one of these tools [06:20] So we can follow a step by step. - For sure. So yeah, we can jump right in. I'll show off two that I built myself ahead of time, and we're gonna do one on the fly. [06:30] And I do think personal software is going to be huge. [06:33] One reason why I like building some of these, I'm going to show off a couple of Chrome extensions that I've worked on. [06:39] And one thing I like about building some of these is compared to maybe some of the offerings we have today from the AI native browsers. [06:46] is any browsers are great. Uh, like I do use comment, um, [06:50] But it's using the... [06:52] the Swiss army knife and it does everything. But in order to do that, some of its processes are definitely slower. [06:57] which there are certain things that's totally fine for. [07:00] But there's other steps where you want it to be really quick and you want to use the drill instead of the little tiny screw driver that came with this was army knife. [07:07] And so I'll show off a couple that I've already built, and then we'll jump into one that's not necessarily even accessibility only, but I think everyone could benefit from. [07:14] So I'm going to hop into Slack now. BabyList, where I work, is a big Slack company. So lots of my stuff is very Slack based because it's where I spend a lot of my non-coding time. [07:24] I'm gonna hop in here and I'll share and show a couple ones I've built so far. [07:28] This is a little temporary Slack channel that we have for this with some example messages that were actual messages sent by my colleagues. I just had them resend it into here. [07:37] So first one I'm going to demo is a image description tool.
[07:41] So when someone sends me an image, [07:43] I use a screen magnifier, so I typically am looking at my screen at about 10x zoom. [07:49] But it's not the easiest to do. And I prefer to not have to always be paying attention to that if possible. [07:56] So I use this tool, but to show off to be able to get the gist of an image without needing to ask somebody to explain to me or me after naturally use my eyes to do it. [08:05] So I have a shortcut in Slack. If I hit, I'm on Windows, so Control Shift D. [08:10] on any message. [08:11] It's going to pop up and go off and describe that image for me. [08:15] and tell me a little description of it so I can see, Hey, it shows a modern infant baby stroller with a car seat. Um, it's got a canopy. It's got all details about this. And the cool thing is I can go ask some followups so I can say, um, what age child is this for? [08:34] And we'll head off to Chachapiti and get the response for this as well. [08:39] And so we can get our answers there. And it's just a nice way for me to not have to necessarily push back and work with people and get some answers to my questions as as I go on this, too. [08:47] And I think this is something that folks don't really appreciate, which is for you, [08:53] You know, you have the ability to zoom in, look at this, but it's just from a time perspective, probably a lot more tedious for you to do. And so folks have thought a lot about image to description in terms of generating metadata for images. [09:09] Ecommerce sites, which I'm sure you all think about a lot, or we had an episode with a documentary producer that highlighted using image to text descriptions plus metadata.
[09:22] to organize archival footage a lot easier. But this is a great example of [09:27] image to text being just a much more efficient image [09:31] information transfer method for someone like you who might need to parse this information differently. And then what I love about this is you could have just done the image description, right? Just what is this image and tell me. [09:45] But you actually were able to go that next step and say, great, if I need to query more information about this to understand more context, [09:53] you make that really, really... [09:54] easy. So I love I just I love this example. And you you built this all yourself? [09:58] Yeah, this was probably 25 minutes of a cloud code session. It was pretty straightforward. Awesome. A flavor of this one I'm working on right now is a version that works in Figma directly as well. But given anything, my node will explain it to me with a much different prompt. [10:15] Right. In the Figma case, I want to hear about the colors of the CTAs. I want to hear about all this stuff because I am. [10:20] a full stack engineer. And so you can get all that out of Figma, but it's lots of clicks and lots of different steps and being able to just hit one keyboard shortcut and find out what this design is really accomplishing. [10:31] is going to be a nice, easy win for me. And that one is just about done as well. [10:35] It's got a little bit of bug, so it's not ready to fully demo, but that's one that I'm excited about for as well. [10:40] So before we go into building one of these, are there a couple other extensions that you've built just as inspiration for folks? [10:47] watching or listening that you think are really interesting to show? [10:50] Yeah, another one that's not necessarily accessibility focused, but it's a cool one. So I
[10:55] uh i'm not the best typer in the world i i don't even think that's my vision's fault um i just i i think i'd like to think i'm a touch typer but i think my brain goes faster than my fingers sometimes [11:05] So I have one that I built that is just like a really easy spell checker. There's lots of tools that do this grammarly and all this, but simply, they're not all screen reader accessible. They're multiple clicks away. Sometimes, um, [11:18] And so I built one out that works in any input field on the web. [11:23] I'll demo it here. I'm going to say test. [11:26] Testing typos in the message. [11:33] And then if I hit Control Shift S here on this one, [11:37] This is going to go off, send that off to open AI and come back with, with that. And while it was doing that for me on my screen reader, it said like processing spell check, spell check complete. [11:47] And so I know when I'm writing a message, I don't need to necessarily worry about all the polish on it. [11:52] I can just do that. I hear spellcheck complete. I'm ready. I go off, hit that and send it off to people. And I have a prompt there that's basically like, [12:00] Do not change any of the words, just fix typos, like really hyper-focused to make sure that it's [12:04] the content that I wrote, but just what the typos corrected it. [12:07] So I'm leaning in and smiling a lot, one, because if you've been watching How I AI, you haven't heard about my fancy nails. I'm a fancy nail gal. And with these fancy nails, I cannot type anything. It is all typos. And so this is such a great little workflow that you built for yourself that I'm going to steal. The two things I want to call out for people who are watching or not watching the details is...
[12:35] you are actually running Slack right now in Chrome. So at first I was like, wait, how are all these apps interacting with the Chrome desktop app? But you're running Slack in Chrome, which means that these are all extensions that are available to you to interact with content in Slack and make modifications via Slack. [12:54] a Chrome extension. So I think that's a really interesting hack for folks that are like, okay, I can't hack my way into the desktop app. [13:01] But I can load Slack in the browser. And then on top of that, add a browser extension, [13:10] that can do these interesting things for me. The second thing I love is how you're using so many keyboard shortcuts to trigger these [13:19] micro apps. And again, this is about efficiency. I always say in these AI products, that latency is the killer feature. And so anything you can do from a UX perspective, [13:31] or from... [13:33] a performance perspective to make these little apps more efficient. [13:38] the better they're going to feel. And so I love that you, you know, type a couple keys and you get a fully corrected sentence here right in your browser. It's a great idea. [13:46] Yeah, the first version I had of this was just using the like chat GPT, uh, alt space shortcut that would open up that little like mini chat window. And I had like a saved, um, [13:56] uh just custom gpt to do it and then i was like well why am i jumping out of where am i actually working [14:02] I can save two steps in like three clicks here.
[14:05] And so I think it's almost like piloting it first without doing this and then realizing, oh, yeah, there's a better way. [14:10] And I think one thing about virtual software is the [14:13] return on investment became so much faster. Like before you'd have an idea like that, you'd like [14:18] This is going to save me like three minutes a day. [14:20] But it's going to take me three days to build. So the payback period is just not totally there. And now it's like, it saved me three minutes a day. [14:26] And it takes me 30 minutes to build like the payback period has just become insane for a lot of this tooling. [14:32] I love it. So let's build one. I want to see what your flow is for actually building one of these things. Yeah. So before we build it, I'm gonna talk about what I want to build. So one thing that comes up a lot in the babyless Slack world and probably in many other companies Slacks is people send links. [14:48] all the time. [14:49] And for me, I often just hit like the save for later button. And then maybe at the end of the week, I like decide if I want to read them. [14:56] But I've realized that like [14:58] Maybe it's the, the, do the thing that takes you one minute, uh, in a moment instead of keep deferring it. I think it would be great if there was an easy shortcut where I could have an AI go off. [15:08] and fetch this. [15:09] article, give me the key takeaways, and then I'll decide, do I want to actually do a full read and save for later? Or do I just skip it in the moment? [15:16] and have that all work in [15:18] under 5 to 10 seconds. I think it's a much more powerful than, uh, [15:22] deferring and having this big to do list at the end of the week when I want to catch up on all these messages. So you're going to show us how how you built this. And what I have to say is all of us are so overwhelmed with so much context and links and docs. [15:35] And yeah, you know, I would see something like this, whether it's
[15:39] a partner or a competitor or just something somebody found that was interesting and you want to go oh yeah I should definitely read that but should you should you definitely read it so this quick summarization is is a great idea and so you're going to walk us through. [15:55] how to actually code this up using Cloud Code, I believe. [15:58] Yeah. And I'll jump in along the way with a couple like cloud code tweaks that I've made, or at least lean into to try and make it a little more screen reader accessible. [16:06] But again, I similarly think that lots of things that are good, cool screen readers probably are any good tools for everybody across the board. [16:12] Great. So let's jump right in. [16:14] So I'm going to switch over. [16:16] to my terminal for just a second so I can initialize our project. So I'm gonna run a make dir command to get our repo setup. [16:27] So we have our Slack summary extension. I'm making that directory. [16:31] And I should have open up Claude code quick on this. [16:35] Should I open up BS code on this? [16:37] So VS Code opening and initializing here. [16:40] I'm gonna open it up as big as I can. [16:44] And we are going to jump in as this finishes loading. [16:47] So I'm going to start here. [16:49] by making a PRD like every good how a podcast goes. That's exactly right. [16:55] And I'm going to do this with audio. So [16:58] Sometimes I'll use with for flow for what I'm doing, but in this case, I actually find the BS code copilot audio to be pretty good. [17:05] And so if I'm just doing something kind of quick like this, I'll just end up using the copilot integration.
[17:11] So you'll see I can do a control I and then when I hit control I again, it's going to dictate. So I'm not going to I'm going to pause my talking and switch in that mode. And I'm going to dictate out for us a little bit of this PRD. [17:22] and then see what it comes up with for us. [17:24] We want to build a simple PRD for a locally run Chrome extension. [17:30] "Whose job is it?" [17:32] to [17:33] exist in Slack alone, [17:35] And when focused on a Slack message, [17:38] you can hit the keyboard shortcut [17:40] Control Shift 1 [17:43] And it will search that message to find any external links. [17:47] If there are external links found, [17:49] It should open them up in hidden tabs. [17:52] extract their content, [17:54] and send it off to OpenAI to summarize. [17:57] And we'll see here, I just finished that. It's going to go off and quickly generate a small PRD for that. [18:02] This doesn't take very long at all. [18:04] I'm going to accept all the changes because I find reading in that diff view to be particularly painful with a screen reader. [18:09] Um, it's not like terrible, but it's much easier just to read it in the document. And since there's a new document being made, I'll now look at it here and we'll see how this looks. [18:17] So we have our goals, we want some privacy security, makes sense. We have some user stories here. We have some functional requirements. It's gonna parse, it's all making sense so far. It's a non-functional, and it's not a scope. [18:31] There were images. Yeah. I already demoed my image processing and so open questions. [18:35] I where and how should we summarize makes sense. We don't need success metrics for this. It's internal. So let's answer some of these questions. So let's just select it all and add it. I'm going to dictate again.
[18:46] So I can hick for a while and I'll start. [18:49] We want to build a very simple PRD here. [18:52] for a locally run Chrome extension? [18:55] where the job is when in Slack [18:58] in Chrome. [18:59] and hovering over a message that has focus. [19:03] You can run the keyboard shortcut [19:05] Control Shift 1. [19:08] And that will [19:10] Look for any external links in the message. [19:13] If any are found, open them up in hidden tabs and extract that content and send it over to OpenAI. [19:20] When in OpenAI, we should summarize them, [19:23] and extract three to five key takeaways from the article. [19:27] and return those to the user in a fully screen reader accessible modal [19:32] which includes the article's title, [19:34] and a link out in a new tab, [19:37] to view the article. [19:39] Okay. We now have this generating the PRD. [19:42] So we can see you talk to our keyboard shortcut. It's got our goals. [19:46] minimize your effort, extreme accessibility is key for this. [19:49] Some basic user stories. [19:51] some punctual requirements. [19:54] Cool, this looks good. - And one thing I have to call out here is, [19:59] You're an engineer, and that was a pretty good product description, and that resulted in a pretty good PRD. So one of the things I like about AI is, as I say, there are no lanes. If you are an engineer and you have an idea, you can write. [20:13] a very good PRD using a little bit of AI assistance. If you need a tip, I know one or two tools that can help you with it. Yeah. And this is no, yeah, no custom purity prompts or anything like that. This is just mostly the foundational models work here. Yeah.
[20:29] Cool. So now we're going to hop into a new tab, spin up clod code in here. [20:33] And ahead of this, as I mentioned, I built a couple of these Chrome extensions. So because I've done a few of them, I ended up building out a clod skill to help me build more Chrome extensions. So after I built the first two, [20:46] I had Claude look at both and figure out what was the patterns that were common across them. [20:51] and work on a skill so I could just build the third, the fourth, the fifth. [20:55] in a much simpler version and extract out that common piece. [20:59] I have found Claude skills to be a mixed bag in terms of actually being picked up automatically. Um, so I am going to explicitly say like, use the skill. [21:06] But technically you're not supposed to need to do that, but I definitely found that that's a buried in its approach. [21:11] So I'm going to make a prompt here. It's going to more request the skill. [21:14] And for folks that are wondering how to set up their own skills, we do have a How I A episode. It is Introduction to Quad Skills, where I explain how... [21:23] That Claude Skills. [21:25] our files [21:26] in a folder. Sometimes they're zipped files in a folder. So if they seem mysterious to you, go check out that mini episode from, I think it was October or November, and learn how to make your own clod skills. They're pretty useful. [21:39] Yeah. So in here, the prompt is just going to be, so at first at mention the PRD. So we have that. [21:44] Thank you. [21:45] And we'll say use the clawed skill [21:51] for creating Chrome extensions to [21:56] build out this PRD.
[22:00] And one thing, um, cruel, uh, cloud code has added this feature where you can edit a prompt [22:06] instead of just in the terminal, in a code file. And so in Cloud Code, if you hit control plus G, [22:13] It will open that prompt in a text editor. So especially for me, where navigating that terminal is not super screen reader friendly. I now am navigating it in the same place that I write code on a day-to-day basis, which is very screen reader accessible. [22:26] And so again, other people may find this to be useful. You can craft deeper prompts. You can like control F in here. You can do whatever. It's just a file. [22:33] And so I think it's a really useful tool they added a few versions ago. [22:36] to make it a little bit easier to work with. [22:38] And I'm just going to put a note here. Use my open AI key from my shared Chrome extension. The config. I've had some [22:51] I don't want to keep pasting my open AI keys and stuff like that. So I end up pulling out some shared config to share across all my Chrome extensions. That way I don't need to rinse or repeat that step over and over again. [23:00] So I save this file now, and now whenever I close this, it's going to replace my prompt in Cloud Code with that completed prompt. [23:08] Oh, interesting. [23:10] Yeah. So you get super effective, especially as you want to do a deeper prompt to use this and not have to worry about the whole terminal side of things. [23:16] Cool. Now we're going to kick this off. [23:18] And so I do have this cloud code session right now in planning mode. You'll see it's requesting use the skill, which is great. We want to use our skill. I want to shrink this terminal so we can see more clogs and really need this as much. Okay, it's going to run some commands that actually pull in that cloud that prom extension config that I talked about.
[23:37] Another thing I have to make cloud code a little more accessible is, again, I'm not necessarily seeing everything that pops in as it's going. [23:45] And so I set up a Claude hook that whenever Claude needs user input, [23:50] it will [23:52] basically like ding a bell on my computer so I could hear a sound that is like, Oh, Joe, you need to do something right now. [24:00] to work on this. [24:01] I actually don't want [24:03] cloud code to read this file. So I'm going to say no, because this is, is some secret configuration. So I'm going to say, no, don't read that file. Uh, now that has a API key and don't read it. If you need. [24:19] You then use JQ to extract the keys. So again, luckily I am a engineer. So it's not like fully vibe coding this, uh, from scratch. I know that there's a utility that will extract just the keys out of a JSON object. [24:33] And Claude won't be able to see the values, which is the actual secrets there. So now here we are, it's just going to make a sim link to this extension for us. So we don't need to worry about the configuration. And if I ever do change it, it will automatically update in all my extensions. [24:48] So instead of having each one of their own API key and my key becomes invalidated for some reason, um, having to update all of them, I use this, uh, concept called a symbolic link. So just link the same conflict file into all my extensions. [25:00] So one change fixes everywhere. [25:02] Yeah, and this is one of those things that's just easy to do when you're running these things. [25:06] locally or just building stuff for yourself is you just make the maintenance and
[25:12] the maintenance and deploy of these really easy for yourself and make it as simple as possible for you to repeat building things and using the same, for example, API keys. And, you know, when you want to share all these publicly and publish them to the Chrome extension marketplace, we can do a little cleanup. [25:30] Exactly. [25:31] So here's our plan. And just like before, control G also works to edit plans in the editor. [25:38] So similarly, this is going to be a pain to read in this terminal for a screen reader. [25:42] But also if I want to make a tiny tweak to one thing here, I don't need to worry about telling Claude to update it or write it to a file. [25:48] I just hit Control G. [25:50] opens it up in this file. And now we have our full plan here. And you can just tweak different parts of it if you want and, and modify it. So it's another great usage of the Ctrl G shortcut. Yeah, I want to call this out for people because so many folks would get something like this. And then if it was wrong, [26:06] kind of say, no, this is wrong. Please update XYZ or ABC. And, you know, you're calling out, not only is that a pretty inefficient way for you to, [26:16] to interact with this file in terms of accessibility and your need for a screen reader [26:21] But it's also just not the fastest way to give it feedback. And so your ability to just take this [26:27] move it into code, use this control G, I believe, [26:32] edit it, [26:32] close it out, run it, is just, again, a lot more efficient. [26:37] Yeah, so we'll do a quick one through this plan. [26:40] I'm using now again, some more keyboard shortcuts just to break it down and, and, uh, fold the markdown headings. I can, from my perspective, it's hard for me to visually, or I don't visually scan the page.
[26:50] So reading through a big file, I typically in code or markdown rely on folding so I can collapse different sections and read them. [26:57] and then expand over the sections I care about. [27:00] So I don't care about certain aspects here. [27:05] But like, [27:06] Maybe I want to get deep into the error handling piece. [27:09] So I'll expand that section and just read this part. [27:14] logging, some key patterns, but this plan generally looks good to me. So now I will just save this. [27:20] And again, close the file. And that is what is over here in the prompt. Cause I didn't modify it. It didn't take any time. It was just ready. But if I modified it, it would take a split second. We'll load that new plan in and then it moves forward. [27:30] Yeah, and I just have to call this again out again for people who are maybe listening to the podcast or again are not paying attention to what it means to use a screen reader here, which is you got your little your little headphone plugged in right now. And I am so impressed that you're using the screen reader while walking us through this demo. [27:51] so fascinating about watching your workflow here is [27:55] It's super efficient and very efficient. [27:58] fast, [27:59] even if you don't take into account you're using a screen reader. So the fact that you've been able to build these shortcuts, these tools, [28:07] use Claude code in a more effective way [28:11] And then you add on this layer of, and it makes using this kind of screen reader. [28:16] a lot more accessible to you is just very impressive. And I don't want people to miss that there's this invisible layer that we don't get to see or hear right now that you're also putting in between this, which adds a little bit of micro friction.
[28:29] Yeah, and I think one thing that's great about cloud code, so [28:32] Like right now, visually, one of the options is selected in blue. I don't necessarily know which one it is and using the arrow keys, it does not tell me with the screen reader what's selected. [28:41] But Cloud Code has done a great job of standardizing where one means yes, [28:45] Two means often yes, but like [28:49] manually with a variation and then three is like no or type something extra. [28:54] And so I can, uh, I can basically instead of using the arrow keys and enter, I can just be like, yeah, I want, I want to just move forward here. I'm sure I hit the link to number one. [29:01] And so they've done a good job of using, I think, lots of different inputs and lots of ways to make this a little more accessible as it goes as well. [29:07] Yep. And so that consistency, again, maybe this is for folks that are building AI products and trying to reinforce workflows, especially for folks that are building maybe these terminal UIs that I think are really lovely and interesting to build is... [29:22] You want, you know, people love the terminal because it's so fast and you want it to both be performance fast, but you also want it to be. [29:30] UI UX fast, which is if your user always knows one is X, two is Y, [29:35] 3 is Z. [29:37] then they can consistently use these keyboard patterns of one key or two keys to efficiently get through... [29:43] your UI. And I think, you know, taking that [29:46] mental friction, that cognitive friction off a user [29:50] by driving consistency and patterns that people can either explicitly or implicitly learn [29:55] is a really useful tool when you're using UI that is
[30:00] more constrained. [30:02] Because again, a terminal UI is actually constrained to basically text. [30:07] Yeah, and Cloud Code has been working on and has released a BSCode extension that is more of a GUI. [30:13] I've just found that it's a little bit lacking behind some of the latest and greatest features. And I'm like, I want everything immediately. So I'm able to get spoiled. But I think that's going to catch up as we go to and maybe potentially more screen reader accessible option for some of these things, too. Yeah. And again, we love a beep boop at the end of an agent completion. So I love I love the cursor sound. I love that you're using one here for for Claude Code. [30:43] stream of what now that's too much. I think it's a big difference too, between like vibe coding things and, um, like production quality code, right? The final output here is just for me, it's gonna run in my Chrome. [30:55] I don't really care what the code looks like at all. Um, versus when I am building software for my, my, uh, full-time day job, uh, and actually building stuff that's going to be in the hands of, [31:04] millions of users and many developers. [31:07] I do do things a lot differently. The plan I'm going to read very detailed what it's going to actually do. [31:13] um, the, the code I'm going to be reading, I'm going to be doing smaller commits and reviewing it, uh, kind of chunk by chunk and getting much more detailed. [31:20] In this case, [31:21] Yeah, when the final output is just a user of one, the code quality is a lot less important. Yeah, what matters is does it work? [31:29] Yes, exactly. It doesn't work in a little bit of like, did you leak your API key? Those are the two things we want to worry about. But other than that, we are we're on our
[31:41] on our way. And you know, the other thing that I think is really fun here is because you've built [31:48] I think the idea that you pick a platform or a framework for a set of your [31:55] personal software and then [31:57] establish best practices through a skill, and then just rinse and repeat for other use cases, is a really good way to get super fluent with some of these AI tools. And so I've seen a lot of people say, [32:10] All I do is build [32:12] markdown based repos for my documentation and everything else. Everything I build is just a markdown based repo. And then I've gotten really good at using cursor for this. [32:22] And then you have this example of [32:24] So not everything, I'm sure, but a lot of what I build are going to be Chrome extensions. So I'm going to make this framework, get it going, and then I can get really good at cloud code because I'm not relearning a technology on top of relearning a tool. And so I do think you get compounding effects by staying in the same technical space when you're trying to learn these via coding tools because you're not trying to learn on two fronts. You're trying to learn on just the tooling front. [32:54] established. For sure. And I think as we talked about before, when we were starting like return on investment gets better and better because [33:01] Building this one as my third takes [33:03] Half the time is the first one. And when I build the fifth, I think it's had to take a fifth of the time of the first one, like. [33:08] I am getting better. The clod skill each time I build one, I'm just gonna feed it back in and be like,
[33:12] What was the skill missing? Like make, make it better. [33:15] And so I think it makes the return investment may turn into be like two days of payback period or something crazy. [33:21] So yeah, it does feel like [33:23] It's cool to, I think, spread out and try many different things. But it does also just feel great to be like, I have an idea. It is in my hands in under 30 minutes. Like, it's just very cool. [33:34] Yeah. And again, this is one of those things where I tell people to work on their anti-to-do list. And when you have [33:42] a recurring task or a recurring point of friction where you're constantly like opening links from Slack and a new tab and then trying to come back to them later and read them. [33:53] Or, you know, you would be constantly doing this, like, let's zoom in on this image and figure out what it is. And is it something I need to worry about when you have those recurring tasks? [34:03] it's 100% worth it instead of spending the time on the task itself. [34:07] to spend the time never having to do that task again. And I like this idea of the payback period of personal software basically collapsing to zero because it really just illustrates where we are in terms of the efficiency and value out of AI, which is it is much more important than we are. [34:28] to learn to build some of these tools than to do a task right now. Like the payoff is so much higher to learn how to automate the task versus doing the task. And if you can just do the change your muscle memory,
[34:44] to every time you do the task, pause yourself and say, actually, I'm going to learn how to automate the task. You can really, really create a lot of leverage in in your day to day life, even in your personal life. [34:57] Yeah, for sure. And we are just about done here. So I was checking back in on our little, uh, [35:01] to-do list, which it just finished. Um, [35:04] So it's doing some final steps here, but we're just about ready to actually load this in. [35:09] Right now it's just kind of analyzing to see, uh, [35:12] Yeah, perfect. [35:13] So it's running this one last step, which is going to be. [35:16] It's basically telling us, hey, go load this in. [35:19] So once we are [35:21] done with this, we need to actually load it into Chrome. [35:26] So Chrome is the mode for extensions called Developer Mode. You'll see I have that toggle on at the top. [35:31] And it basically means that you can install extensions not from a Chrome Web Store. You can install extensions like from your local computer. So you don't want to generally have that on because like somebody could have side loaded in some some credit card skimmer that you've imported or something. But if you know what you're doing and you know, you just built a thing. [35:49] You can go in here and turn this on. And then this looks a little bit different once you have this on compared to probably you guys who are not having this on look in your Chrome extensions world. [35:58] because we have these options on the side here to load an unpacked extension. [36:02] So that means an extension that's not like fully deployed in the App Store. [36:06] So let's hit this. [36:08] And this is going to open up our little file browser here. So let's just pop back. [36:12] And we had called this
[36:15] Slack summary extension. Okay, so that is now loaded in the bone of truth. With the two of the software is you can only test it, or easily test it in Chrome. So we're going to actually try it out. [36:26] Whenever you download a new extension, you do need to refresh your tab. So it picks up that extension. So if right now I tried to use it, it's still working with the extensions that I had at the time I loaded it. [36:37] I'm going to refresh this Slack. [36:39] So I can pick up our new extension here. [36:43] And our moment of truth is going to be we're going to focus on this message. [36:46] And I'm going to hit my shortcut of control shift one and we'll see. Did we did we nail it? Oh, look, this is our link. Black color, right? [36:54] Yeah, kind of interesting. [36:56] So it's processing our link. We'll see. Did it work? [36:59] So it kind of worked. [37:01] But we have JSON here, right? It's not perfect. So let's work on one level of refinement here. [37:08] So I'm going to take a quick snippet shortcut of [37:12] Uh, miss? [37:13] So hang on, take a screenshot of this. [37:15] And we're going to send this back to Cloud Code and say, [37:20] Almost. [37:21] So again, it'd be cool if we one shot it. [37:24] I'm having a really cool demo, but it's not a perfect one shot. So we'll make one slight tweak here. [37:31] And I'm actually use a custom slash command I wrote to deal with screenshots. [37:36] So because I'm developing on Windows, but I actually run cloud code in this thing called the Windows subsystem for Linux, it doesn't have access to my Windows clipboard.
[37:46] So I can't do what everyone else can do, which is just like hit control V here and paste it. [37:50] So I added a slash command here called paste image. [37:53] that uses some PowerShell shortcuts to pull images out of my clipboard and share them with Claude. [38:00] And so I can take that snippet and share it. And again, this is similarly I would like would copy a file and I'd like save it in Windows and move it to Linux and then import it with an app mention. I was like, there's got to be a better rate. [38:12] And I use this slash command now all the time for building stuff out. [38:15] This is this is extreme software engineer stuff where you're like, OK, I run on this OS, but I run my terminal on this. And now I can't access my clipboard, but I still like it that way. So I'm going to write a little script to give myself a two word shortcut to make this happen. And so it ported this and it just it drops screenshots in our in my TMP directory. So I just have to say, yep, please read it. [38:41] So, again, it's saving those... [38:45] Those two minutes every day adds up adds up fast. [38:49] And so it's just going to fix this little JSON piece here. [38:52] And again, it was kind of, she was close. It, it, it got the right content, just didn't display it. Right. And so now it's going to go off and work on this for another second here. And we should hopefully have a quick update. [39:00] And the nice thing is with with Chrome extensions in the developer mode, [39:04] There's just an easy one click button where you will update and grab the latest copy of all of your extensions. [39:09] So if that's your work on multiple at a time or whatever, you can just hit that button and it's going to update for us. So we'll see it's finishing up here. [39:15] And one of the things that it's doing, just I'm calling this out for people who are writing,
[39:19] queries to open AI. It's moving the JSON out of the prompt, which it's saying, please return JSON. I think it's actually just returning JSON as a [39:28] string in text and it actually moved that to change the response object to being JSON. So then it can actually be read by the [39:38] by the Chrome extension in a more structured way. [39:41] Control shift one. [39:43] The end of works. [39:45] We're doing drum roll. [39:47] Beautiful. We got it. Nailed it. [39:51] So again, we've got these takeaways, we can now action on this and I can decide, do I actually care that iterable added MCP? I do. [39:57] uh spoiler alert um but yeah again we did it uh [40:01] I think under 25 minutes here. [40:03] And is it working in your screen reader? Did the accessibility pieces go through? Yeah, fully accessible. [40:09] So this modals in general can be sometimes problematic, um, because a screen reader will sometimes read behind the modal. Um, but. [40:17] Surprisingly, although not a bunch of the web is not accessible, if you tell some of the foundational models, like, please make this accessible. [40:24] The accessibility standards are actually incredibly well documented. [40:28] So they, they do actually a great job at this. So they, they use the right, uh, it's called Aria, A-R-I-A. [40:33] roles and make this modal have the right focus, not letting you read behind it. [40:38] So, uh, on the box, they're not going to make everything extremely accessible, but you say, Hey, go do this. It'll gladly go follow the spec and make it accessible. [40:44] So this is a meta question and maybe before I get into the meta question, let's just recap for folks what we saw. So you built a Chrome extension that's focused to the web version of Slack.
[40:57] that Chrome extension that's running locally because you've toggled on developer mode in your Chrome settings [41:04] We'll take a focused link that's shared by a colleague or somebody in Slack. It will go out. It will parse that link. We'll see if there are any links in it. [41:12] I'll parse it, it'll take some key takeaways. The way you built this is you bopped into VS Code, you dictated a short PRD, [41:20] You let AI kind of build that out. You made minor tweaks to it, but basically shipped it. [41:26] You used Claude code, including some custom slash commands, [41:31] and a Claude skill specifically around building [41:35] Chrome extensions. [41:36] to then scaffold out that Chrome extension. You showed us Control G in Cloud Code, where you can actually just modify [41:44] prompts and inputs as code, which is much more efficient, both from an accessibility perspective and just general user experience perspective. And you showed us a custom screenshot. So you're very special, as I say, [41:58] you know, unique snowflake software engineer environment can operate as if you as you want, even if there are some technical hurdles. And then now we have this great little extension that I want running on on my app. So this is great, Joe, I love this. I want to hop into some lightning round questions. And this has given me an idea of one that I really want to ask you, which is about [42:19] MCPs. So one of the things that I think is so interesting about MCPs is [42:25] is it allows you to bypass all...
[42:29] UI. [42:30] and just get to the bones of what a SaaS product does. And I can imagine that [42:37] While there are lots of okay accessibles, enterprise software products, not all of them are building for maximum accessibility, either in their design or in their kind of underlying way they're implemented. Have you found MCPs and just that interface into some of these SaaS tools has improved accessibility for you, has not? What are your thoughts there? [42:59] I think the altered goal of mine is [43:03] I would love to do everything in one place, not have to switch tools, like whether the context switch cost or just a. [43:09] A switch costs. So having MCPs has been great for that. Um, [43:12] Luckily, I actually think lots of enterprise software surprisingly is being built pretty successfully. Like I want to really give strong kudos to like Google Docs, Google Docs for what it does. [43:21] It's so crazy accessible and the work that people don't know that goes into make it that like every single thing that is being done. [43:28] is being communicated to the screen reader basically letter by letter, um, via this, like, secreting, uh, the secret system that people don't know about called like Aria live announcements. [43:37] is kind of crazy. [43:38] Um, but I do find like, Hey, I need to get something from three sites. Like that's kind of painful. Like, can I just use the notion MCP and cool docs, MCP or Glean in there is great. [43:48] Notion is one that is a little bit harder. They do their best from an accessibility standpoint in some ways. But there's a lot going on in an Notion post, an article. So I think that's another example where it's like, [43:59] Yeah, I can just pull this down and work in the Markdown version. That's going to be a lot easier. Um, and again, I've got.
[44:04] It's ways you're going to navigate with these like keyboard shortcuts and the folding features. So I will pull down some notion posts and just be like, dump this into a markdown file for me and then use my little code shortcuts to help navigate some of those pieces. [44:16] Yeah, I love that. And so... [44:18] My second question, again, is around personal software and the ability to translate. Sort of as you said, you can take a pretty complex Notion page. [44:28] turn it into Markdown. That allows you to, you know, read and parse it in a much more efficient way. And so, [44:35] I think this ability to translate files or formats is [44:40] a really exciting part of AI. And we've hinted at a couple things we've seen in this episode, a little bit of like image to text, [44:47] a little bit of voice to text. [44:50] But I'm curious for you, what are you most excited about in sort of like the multimodal world of AI? And, you know, what recently has come out that's, you know, caught your eye and made you excited? Or what are you hoping to see in the next couple months or years that you think could really open up stuff, either for you personally or just as a product builder? [45:08] Yeah, I'll talk in the personal space. So I have, uh, I have two kids, I have a five-year-old and a three-year-old and reading books to them is, is a challenge. I don't know Braille. I've. [45:17] I've been trying to learn, but it's a hard skill to pick up at 33. [45:21] And so I've memorized a handful of books, um, and I'll read those, but it's not really reading. It's fake, fake reading. [45:27] Um, but I, uh, big shout out to the Gemini app and it's, uh, like live share features. [45:33] I can now read any book. It sounds like me reading it, but me and my three-year-old Cole will sit on the couch. He'll bring a book over.
[45:39] And I'll be like, Hey, let's I can read this one or no Gemini can read this one for us and we'll turn the pages and say like. [45:45] Gemini next page and Gemini will read that page and then we'll turn to the next one. And it'll read the content of that. [45:50] And so I think just like equitable access to everything, um, is, is great. And that piece is one thing that I was always afraid of. I was like, can I read stories? I can memorize stories. I can tell stories, but there is something to just like your son being like, I want to read this book and you having to be like, sorry, I can't. [46:05] And now that's how I can't become like [46:07] "Sorry, I can" with the assistance of [46:10] so many different tools now, but I think the Gemini one is particularly useful [46:14] And I found that one to be the strongest for just like [46:17] Easy sharing, just saying next page, it knows all the context, it immediately starts reading it. [46:21] I have like meta glasses. I have the chat to be team pro. I've got all these things, but I think Gemini is doing the best job of it right now. [46:27] And this is before me trying any of the Gemini Pro three that just came out to see if any of that makes it even better. [46:33] Well, that is a very, very sweet story. And yes, I was just thinking, I wish the meta glasses, which I... [46:40] love and use every day would also help do a better job there. But it's awesome to hear that Gemini can add to that special time with kids, which is [46:51] As you and I were talking before the show, you know, we both are boys, parents of boys is just is such a special time. So my last question is when AI is not listening, and I'm curious if you type or if you speak this. [47:04] What are [47:05] Your prompting. [47:07] techniques have you ever whisper flow yelled
[47:10] at your AI or, you know, do you have any tricks for us when AI or Claude gets really, really stuck? [47:18] It's kind of, I think, a nerdy answer, which makes sense for me. But my typical mode is basically like, [47:24] clear the context and start fresh as much as possible. I think [47:29] I think a lot of people, uh, will, will try and like keep massaging it and being like, if I just send this one extra prompt in this conversation, it'll figure it out. It's like, no, you just have to start from scratch and take the learnings that you have from the last time. [47:42] And so sometimes I'll be like, this hasn't been going great. What did you what did you learn about this? Take that and feed that into the next prompt. But most of I used to be like, let's start from scratch, something clearly got poisoned in this context. And we start from scratch. I feel like it just everything just feels smoother. [47:56] I love it. Well, Jo, thank you so much for showing this. I think it's just [48:01] one of those workflows that we haven't seen before everybody can find a use case i am thinking of all sorts of little micro frictions in my own life [48:09] where a keyboard shortcut or two could really make things a little bit better for me. [48:14] Where can we find you and how can we be helpful to you? Yeah. So I mentioned at BabyList, BabyList is very actively hiring. And if you are somebody who likes using AI in your day to day building of software and you're [48:27] software engineer. Uh, we are Ruby on rails and react shop. Um, [48:32] but hiring across the board, all different levels. So check us out on babyliss.com. [48:37] And personally, I'm on LinkedIn as well. So feel free, especially if I'm the accessibility questions or any questions on some of the Chrome extension piece, I'm always happy to answer on LinkedIn.
[48:45] Well, thanks for joining How I AI. [48:47] Thank you. [48:57] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. [49:14] See you next time.
Want to learn more?
Ask about this episode