Nicholas

How Block’s custom AI agent supercharges every team, from sales to data to engineering | Jackie Brosamer & Brad Axen

Nicholas

VP of engineering Jackie Brosamer and principal engineer Brad Axen join me to demo Goose, Block’s open-source AI agent that runs locally, plugs into your existing tools through model context protocol (MCP) servers, and peels away the rote parts of work so people can focus on insight and impact. This episode is packed with in-depth demos: starting with a messy farm-stand sales CSV, Goose analyzes the data, builds visualizations, and generates a shareable HTML report. We then spin up an MCP that lets Goose talk to Square’s dashboard for inventory management, vibe code an email MCP that can send payment links automatically, and unpack how environment setup, debugging, and tool orchestration get handled behind the scenes. What you’ll learn: A practical, repeatable workflow for turning any working script or function into a custom MCP—and exposing it to natural-language control How to transform messy CSVs into visualizations, HTML reports, and actionable business insights without needing a data science background Ways to hook Goose into live business systems (e.g. Square inventory, payments) so analysis flows directly into operational action The thinking behind Block’s decision to open-source Goose Lessons from Block’s bottom-up meets top-down adoption model Why organizational transformation, not just picking the right LLM, will separate AI winners from laggards over the next few years How to scale an internal MCP catalog The organizational transformation required to fully leverage AI capabilities — Brought to you by: CodeRabbit —Cut code review time and bugs in half. Instantly.

Published
Published Jul 28, 2025
Uploaded
Uploaded Jun 13, 2026
File type
POD
Queried
0

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:13

[00:00] Goose is our AI agent. We kind of designed it to be really agnostic. You tell it what you needed to do by connecting it to different capabilities, and it can just solve any problem. What I'm going to do is take that CSV that Jackie was working with, I'm just going to pop it over, and I'm going to say, can you read through this? [00:16] data, use it to create items in my square dashboard. So we took a CSV that we didn't even have to look at that probably had data that was not in this exact format and you created a product catalog. What a lot of my work and what my team is trying to do is not just make it faster for someone like me or on my team who has data expertise to go through this but to allow our finance team, our sales team, anyone in the company to be able to dig in and self-serve a lot of this data rather [00:46] ask an expert and wait for that to come back. You have been the first person that's told me it was the salespeople that were begging for it. We hear so much, you know, engineers are leading in and they want to vibe code and they want to do all that. But it's nice to hear that the folks close to customers and close to revenue are also seeing the value. Welcome back to How I AI. I'm Clara Vo, product leader and AI obsessive here on a mission to help you build better with these new tools.

1:16-2:56

[01:16] block. [01:16] who are going to show us how their open source AI agent, Goose, can be used to do everything from vibe data analysis to vibe coding and MCP. [01:25] Let's get to it. [01:27] This episode is brought to you by CodeRabbit, the AI code review platform, transforming how engineering teams ship faster with AI without sacrificing code quality. Quality code reviews are critical but time consuming. CodeRabbit acts as your AI co-pilot, providing instant code review comments and potential impacts of every pull request, beyond just flagging issues. CodeRabbit provides one-click fix suggestions and lets you define custom code quality rules using AST [01:57] patterns, catching subtle issues that traditional static analysis tools might miss. [02:02] CodeRabbit brings AI-powered code reviews directly into VS Code, Cursor, and Windsor. CodeRabbit has so far reviewed more than 10 million PRs, been installed on 1 million repositories, and has been used by 70,000 open source projects. Get CodeRabbit free for an entire year at coderabbit.ai and use the code HOWIAI. [02:28] Thanks for being here. Thanks for having us. [02:30] Yeah, excited to chat. I am so excited to have you two on How I AI because I have been so impressed how Block has, as a full organization and a large organization at that, really embraced AI, it seems, [02:46] everywhere and for everything. And, you know, everybody's worried about AI native startups, but you're one of the companies that I really think is going there very fast as a much larger

2:57-4:33

[02:57] organization. So how did you lean in as an org so fast and so broad? I think it's a combination of both like bottoms up and tops down. We saw, you know, a lot of concentrated pockets once ChatGPT came out that were really finding a lot of value out of these tools. For example, salespeople, they're like, any tool, give it to me, like, I'll use it, help me get those leads. And we also saw a lot of engineers, you know, starting to lean in as these tools became really good at coding. And kind of as that started from the bottom, [03:27] saw a lot of promise and they've been, you know, trying to make some of those practices a little bit more consistent among groups. [03:34] I think that [03:35] It's really a great time to focus on organizational transformation, because to me, that's really what's going to define who ends up the winners in whatever this AI future is, is who not just leans into technology, but even more than that leans into the organizational transformation, which can a lot of ways be even harder. It's like really easy to get technology to go exponentially and humans. [03:58] don't go exponentially so well. You have been the first person that's told me it was the sales people. It was the sales people that were we're begging for it. We hear so much, you know, engineers are leading in and they want to vibe code. They want to do all that. But it's nice to hear that the folks close to customers and close to revenue are also seeing the value. [04:16] Yeah, I think it's, you know, everyone finds like that different first use case that really gets them hooked. And one of the things I've been really impressed with, with some of our [04:25] non-developers especially, like we all kind of know how people use it for Vibe Coding, I think that's like well discussed, is non-developers are just so creative in

4:33-6:05

[04:33] So... [04:34] tolerant of like stringing together a bunch of different tools just in a way that you never would have imagined. And that's, I think, really going to be the power is getting these tools closer to the people that actually understand the problems and see what wacky things they come up with, with this new technology. [04:49] Not only are you all leading the way in terms of how organizations and I totally agree with you, I think the biggest transformation is [04:56] cultural and operational. The tech piece is like a lagging piece of it. You can figure out. But what I also love about what you're doing is not just how you're embracing it internally, but how you're trying to drive and help others adopt it externally. And so I'm wondering if you tell us a little bit about the adorably named Goose and what Goose is and why you all build it. [05:18] So Goose is our AI agent and [05:21] There's a lot of people have a lot of different definitions for what an AI agent even is now. But I'll say specifically what we mean by that is it's powered by an LLM, but it has a whole collection of tools that it's going to go use to solve problems for you. [05:34] And [05:35] This is something we're really excited about, in part just because it lets us build for ourselves and come up with workflows that work for our company. But like you said, we open sourced it because we think there's a lot of extensibility for everybody. [05:48] And we kind of designed it to be [05:50] really agnostic. You tell it what you need it to do by connecting it to different capabilities, and it can just kind of solve any problem. And I think we'll get to show some of that today. [06:00] And that's all powered by MCP. So that's a big part of this like open source

6:05-7:35

[06:05] uh, [06:06] puzzle is how do we make this work? [06:09] no matter what you wanted to do. And MCP really helped us get in like early into helping to establish the protocol and have a way for us to connect it to anything. And people can develop their own MCPs and just make things work. [06:21] Got it. So you and I just want to show here it's up on GitHub. It's open source. It's your agent, your agent framework. [06:29] powered a lot, it seems, by MCPs, which I think we're going to talk about later, that anybody can use and extend upon. [06:38] And what made you all think about open sourcing this? [06:40] Yeah, open source is, I think, a really big [06:43] emphasis just as a value for our company in [06:46] I think there's two ways we think of it. Number one, this is just like the right thing to do, you know, [06:51] for the world in some way. But then there's also we think that there's so much benefit in terms of having this open ecosystem and really seeing these patterns emerge from not just, you know, the engineers that we have in our company, but from engineers all over the world. And I think we're really seeing that come to fruition with a lot of the growth that we've seen in MCP, [07:08] you know, and all the MCP servers that are out there. And it's only been, you know, what, like six, [07:13] eight months. And then I think we also wanted to really [07:18] lean into open source in order to take advantage of all these different models that are coming out. A lot of the tools that come bundled, you know, released by these different model providers can only use like that kind of models. And we know that every month a new model is coming out. One might be better at coding, one might be better at writing. And

7:35-9:06

[07:35] We think there's a huge amount of power in being able to bring together [07:39] data across all these different MCPs and then also pair that with [07:44] really any model that you want. Okay, so speaking of data, I think you're going to show us how to do [07:52] Vibe Data Analysis, which is a first for How I AI. So it's a flow that you use internally. So this is definitely something we're not making it up. You use this exact flow day to day, but we're gonna use some fake data so we can screen share and really get into how it all works. And so Jackie, what kind of business are we today that we're going to do some data analysis with? Yeah. So I tried to make this as realistic as possible when I don't work in tech. I actually have a small farm. [08:18] that is not very profitable. And so we're going to be doing some analysis [08:23] that I've actually done with my own internal data that kind of shows us [08:27] how we can look at some business data [08:30] and make some recommendations. So what we're going to do is go ahead and look at this to try to find this local CSV of data that has July data for my farm stand, which does a bunch of-- [08:48] vegetables and what we've asked is we've asked Goose to use Pandas, which is a Python library, to tell me the items have the best revenue and like what the busiest days of the weeks are. And then I left a general kind of question at the end. [09:02] Just what are some trends because I think it's always interesting when you get a slightly different result.

9:07-10:40

[09:07] as you run this. And so you can see the first thing that we're doing here is it's using ripgrab to find [09:12] the CSV, [09:13] We found that riprep is actually really effective, even though it's really simple. There's a lot you can get done with it versus something fancier like a semantic search. It identifies this July data at CSV. Then it writes a Python. [09:28] command, it found some error where it didn't have this installed. Oh, no, this is maybe going into cursed virtual environment territory, but it looks like it's going to solve itself. Sometimes I'll tell it like check your Python environment so that it makes sure it sources it, but it decided to go ahead and create a new virtual environment here. One of the things I really like about Goose is I'm terrible at maintaining developer environments. And so we'll go through and try to just like debug that for me. [09:55] I mean, how many of us, maybe this is a very niche comment, have been victimized by trying to get the right version of Pandas installed? [10:02] Because I definitely have. OK, and just taking a step back, we didn't see the input data, but is this just like general day by day sales of different products? Exactly. This was just a list of item, quantity, date, price. And so from that, it's able to go through and say, OK, your top revenue generating items are berries. We love berries, very high margins. And it's got the best days are like Thursdays and-- [10:31] And then I just had this bonus question of like, what are some key items and trends? It'll kind of lead into some stuff later. So cherries have the highest average value.

10:40-12:13

[10:40] It goes through, you know, what are your mid high low items? [10:43] And then, you know, [10:44] goes through some weekly patterns. And so without even asking, of course, it comes out with some recommendations, which I really love. Let's just pause on these recommendations. So what I love about this is you have this ugly file, you probably could have tossed in a spreadsheet or done some of this analysis yourself. And you're you're doing sums and you're doing pivot tables. And you're trying to explore this. And you probably have to do different tabs for all these different analyses. And this you can really drop in. And it's maybe a relatively if you scroll up, it's, you know, [11:14] but under the hood you have this pretty powerful data analysis package and so i'm presuming can go from this level [11:22] of analysis to something much, much deeper. Yeah, that's right. And I when I'm doing something like this live with something like you, what I'll usually do is kind of you have it take a high level first pass and then you know, dive into specific sections and say like, Hey, can we go [11:36] you know, use this particular type of method, you know, do a line regression that tries to figure out price elasticity or something like that. [11:42] Well, and I have to say, as a mom of young boys, I do feel like I spend, you know, $1,500 on strawberries every month. So this is really hitting home. Okay, so we can go down and then I'd love to see [11:54] I like the idea of when you're prompting a data analysis to say, give me the data analysis, but also give me some recommendations. And so it's really saying here... [12:04] you know, high margin items, which is great. Look at why Friday drops because, you know, maybe they would expect that to be a higher day.

12:13-13:55

[12:13] weekend sales, and then some price adjustments. That's really... [12:17] Interesting. So what would you do next after this? Yeah, so I don't think that this is like very good to share. And so a lot of times what we see is that people really like to share these kinds of analyses as local webpages, especially people who aren't technical or just a whole new world is opened up with them to them as they figure out how to use AI to turn some of the stuff into HTML and share it with each other. [12:47] users that want to kind of create their own dashboards for, you know, whatever. While this is generating, I'm just reflecting on what I've seen so far. So it seems like Goose has sort of a virtual machine in which it can write code and wrestle with Python dependencies. Love that for Goose. It has the ability to spin up hosted HTML and websites. Is that right? Is that what it's kind of doing here? So it's all running locally. I think that's one thing that's like [13:17] a lot of our developer environment stuff happens locally. And so that was like a really natural fit. And then we also saw within the open source community that this has actually been really popular with kind of the crowd that really wants to maintain [13:29] end-to-end control over their workflows. [13:33] One of the things we've seen with Goose is that some of the most promisey use cases are, you know, you tell Goose to do something, you kind of go away and... [13:42] wait for that task to complete as it does things like write code errors. Oh, you're convinced. It's also cool because it went ahead and popped this up for me. This is not the prettiest website it's ever made.

13:55-15:19

[13:55] with this fake data, but you can see it's able to make these bar charts with plotly. [14:00] And I could go ahead and dig in and try to refine this if I wanted it to be, you know, a little bit more well designed. Got it. And, you know, if I was being... [14:10] cheap and easy about sharing it, I would just print as a PDF and shoot it over and it would look nice. [14:15] We run this internally. We have an MCP that it connects to where you can just publish it to the company. Like kind of keep it internally private, but share it with anybody. [14:24] And what are the MCPs sort of out of the box that if I cloned the Goose repo right away, would it have ready to go? And then what are some kind of ones that you've either built or integrated specific to maybe what you do at Block? [14:40] Yeah, so we have like a collection of different MCPs. [14:45] Some of them we've taken from the open source community. Some of them we've developed internally because, you know, for example, we have an internal data MCP that I use recently. [14:54] every day that helps sort through where's the revenue for X product. I mean, it knows all that context. And we're also working on trying to have external MCPs that connect to our business products. A lot of what people are using use for and a lot where we saw the first value is just MCPs that connect to all of our key internal systems. So Google Drive, probably the one I use the most. Brad mentioned this one we have internally called BlockCell that's really popular

15:24-16:55

[15:24] anyone in the company. We also have a developer extension that does a lot of this command line and tool stuff. [15:30] We also have a computer controller one that just did. And that's how it just opened this Chrome window. Got it. Cool. That's the stuff that bundles with Goose. So when you get it, it's like you're going to have like five really simple like memory developer. Yep. And then when we use it internally, we have like 40 or 50. [15:48] Yep. [15:49] And Jackie, how often are you doing an analysis like this? [15:54] in your week using Goose? If I'm doing an analysis, I'm using Goose. The question is more like, how many analyses do I do per week? I would say that I probably actually use it more in my day-to-day life. [16:07] for writing, but there's people in my team that, you know, are professional analysts or professional data scientists that are doing this, you know, every day. And I think increasingly what a lot of my work and what my team is trying to do is not just make it faster for people. [16:22] someone like me or on my team who has data expertise to go through this, but to allow [16:27] our finance team, our sales team, you know, anyone in the company to be able to dig in and like self-serve a lot of this data rather than having to ask an expert and wait for that to come back. I think we've all been talking about like the age of self-serve data for a long time. And it's like, [16:42] kind of gotten better, but we all know that [16:45] you usually still go ask a data scientist. And so I think it's really interesting to think about how can we [16:50] you know, kind of short circuit that and allow everybody to focus less on kind of

16:55-18:38

[16:55] the rote parts of data analysis and spend more time talking about, you know, the insights and the context and the creative parts that we all want to spend our days on. [17:05] Okay, so we've talked about the salespeople, love their AI, the farmers. [17:09] love their AI, but can we please get back to what people can wrap their head around, which is the software engineers. So you have this you have this great. [17:18] Great data. [17:19] And you want to do something with it. And so you have this data, you know, maybe you want to do something with it to improve your business. Brad, what's the next step in your mind here? [17:31] Yeah, let me share some... [17:33] on my screen for this, and we can show a little bit about like [17:37] how a developer can build [17:40] MCPs? [17:41] And use that to actually like connect to what someone is doing into like existing products or to like solve new problems. And this is really how I spend my day to day. Like I feel like at this moment in time, I'm just developing MCPs left and right. And I'm going to just pause right there because no matter how many times people hear it, they still need an MCP explained to them. So Brad, we're going to take your explanation of what is an MCP. [18:04] Let me do that by showing you the Patriot and Goose here. [18:08] In this, we have just this huge collection of extensions in Goose, and these are all [18:15] implemented with model context vertical. So each of these is an MCP server. And what that means is that it has a collection of tools [18:23] that [18:25] it exposes to the agent and it can do a couple of other things like provide data resources and all that other stuff. And so it's like a protocol that lets the agent talk to a third party and get things that it can do and data it can read.

18:39-20:18

[18:39] And most importantly, it's kind of the arms and legs for the model. So [18:43] This is how the model goes and interacts with the real world and makes changes in online systems. [18:48] Okay, so you're spending... [18:50] all your time building MCPs. It seems like everybody is spending all their time building MCPs. So why don't you walk us through how you're spending your your days? [18:59] Yeah, and so let me start by showing you an existing one before we start building. And so I've turned on a real Square MTP server. So this is like from this is available, like as an online MTP server that people can use to like run their business. So this is my Square dashboard, and it's totally empty right now. [19:18] This is kind of an account I set up for the sake of the demo. [19:21] And [19:22] We're going to connect that through our agent and actually have it fill things in. [19:27] So... [19:28] What I'm going to do is take that CSV that Jackie was working with, I'm just going to pop it over [19:33] And I'm going to say, can you read through this data and use it to create items and vice where dashboard. And so. [19:47] This is something that you could do with all kinds of input formats. Like, for example, I could give it an image and like maybe a picture of a menu or something like that. And so since it's multimodal, it's just going to go figure out that source. [19:59] translated into the API calls that it needs. [20:02] And I'm thinking as a product builder who has built many import your data products, where you're always trying to say like, it has to be a CSV in this format, in these columns, with this column name. And what I've just realized this opens up for you is like,

20:18-21:50

[20:18] Drop in your arbitrary, messy, whatever data. Don't have to reformat it. Let the MCP figure out the translation. Exactly. And then get it into the system. Okay. Yeah. So it doesn't matter what you've got. It's going to figure out what could be a PDF, whatever. It's going to do its best to like figure out how to make that work. [20:35] you know, the first thing it does is it just reads it. [20:37] Yeah. [20:39] It's then going to go... This is actually a really interesting part. I think the Square team did a great job designing this NCP. [20:45] Because it has... [20:47] so many different things that you can do on Square that it's like too much for the model to handle all at once. [20:53] So what it actually does under the hood here is it says like, "Oh, if you need to know about catalog, I give you a method to look up what you can do in catalog." And then under the hood, the LOM is seeing. [21:05] like all of these operations that it can now access. [21:08] So all that stuff about like creating items that figured out. [21:11] Okay, and I actually see... [21:14] The, it's like, this is a funny, I normally would just read the CSV. [21:18] um so i'm like i'm like laughing a little bit at like how much work it's going through for this but uh it is what it is i know i have i have that same experience using like a cursor max mode where i'm like i just don't know if you need to work to think that hard exactly how to do this but you know it's not my brain it's the ai's brain yeah yeah so we'll we'll see i'll let it cook for a minute here um and see how it figures this out but effectively what's happening is it's just deciding to use [21:47] shell commands to grab the prices from that CSV.

21:51-23:23

[21:51] and [21:51] Then it's going to turn that into an API request and actually send it to [21:56] the square [21:57] Dashboard. [21:58] and create a product category and then fill in the items. [22:02] So we should just be one request away from getting that solved? [22:05] Got it. And instead of sitting here and waiting, Jackie, I'm presuming [22:09] Farmer Jackie would have to go in here and one by one... [22:13] sort of enter this data in and make sure it's right and all that kind of stuff. One of the really underappreciated... [22:22] things about LLMs is how much they function is data duct tape. [22:26] It's like it doesn't quite fit. You can do something to just kind of [22:30] make it all mushed together. And that's where a lot of the day-to-day data work is, is trying to match all those things. How I AI is now on Lenny's list with my personal selection of the best AI engineering courses on Maven. You can spend months thinking and playing with AI before really integrating it into your workflow or shipping an actual AI feature. If you want to [23:00] Neresh Riganti, MIT instructor and AI scientist at AWS, or Sander Shuloff, who has authored research with OpenAI, Hugging Face, and Stanford. To pivot into an AI role or successfully lead your company's next AI initiative, visit maven.com slash Lenny to enroll now. Use code

23:30-25:09

[23:30] slash Lenny to get ahead in the AI era and start building. [23:35] Okay, look at that. So we took a CSV that we didn't even have to look at, that probably had data that was not in this exact format, and you created a product catalog. [23:47] out of it. [23:48] And this is something like I could follow up with this and say like, okay, grab images or even generate images and like, I was gonna ask. Yeah, I want to I want to AI generated Apple image. Exactly. Yeah. And the I'll skip it for the sake of time. But like, there's so much you can do to just like navigate through all of this. And [24:07] This is something where we're watching it live for the sake of the demo, but really the best version of this is like, hand it to CSB and walk away and come back and then you're going to just see that this is done. [24:18] And so that kind of like background task thing is the I think the best mode for Goose in a lot of ways. [24:23] But yeah, it like goes and figures this out. And this is like a real account. So what's cool about this is that this isn't just like a little website showing that I have prices for apples. It's something I can take payments with. So I can now say, okay, let's make a payment link for three pumpkins. [24:41] Thank you. [24:41] And we'll let that run for a little bit, too. This is not the topic I thought I was going to get from you all today. You know, I had a lot of things in mind, but using AI to generate farm stand [24:54] Payment links for gourds was just not on the top of my list this afternoon, but I'm excited to see it. Okay, so I think, you know, one of the other things that I think, and I know this is running locally and we won't get into like sort of the local...

25:09-26:46

[25:09] mobile debate right now, but [25:12] I just think the ability to do this stuff via text on the go is so powerful. Imagine somebody wants to buy the three pumpkins off the back of your truck and you want to be able to do this really easily. This sort of like ability to just type in a short command to get a really custom software solution backed by, you know, either a platform or not, I think is pretty cool for people. [25:35] I think there's like so many little details too that again, like while you're watching it, it's a little slow, but, but it just figures this out for you without any intervention. Like, Oh, it's, [25:44] it has to decide which location I'm selling these from for the payment link. And it's just, it just like saw that it needed it. And like when it looked up what locations I've got here, [25:54] And it just unblocks itself, which is really nice. So you can just wait for this to go. And so, OK, so I actually have a real payment. [26:02] And this is just going to load up and it uses my items. [26:05] has a price. Look at that! Yeah. And so we could just throw in a real credit card right now and this would work and we would collect on that. [26:13] And so this is like, okay, so this is a bit of a tour of an existing MCP, like how you connect these agents to real products. But then I think what's like part of my job is really like, okay, what's the next step? And how do we make the agent work? [26:28] Do the next step. [26:29] And so for me, I think a thing we could switch to from here is like maybe we go give it another MCP so that it can send these payment links in an email. [26:39] And so I have a test email account here that's totally empty. And I think we can work with that and then just try to show you how you can literally

26:47-28:24

[26:47] extend the existing conversation. So I'm going to leave this open, and we'll come back to it. I'm going to make a new window that we're going to use to make [26:55] the like vibe coding half of this. So we'll just go vibe code and mcp together. I'm also going to open up a terminal window so that we can follow along. [27:03] And I did do one thing in advance, which is... [27:07] I went and grabbed [27:09] an API key here in [27:12] an env file, and I set that up just in advance so we didn't have to show the API key on the screen. [27:19] And so we're going to use that to... [27:22] set up an MCP server with Mailman. And I want to just pause here because we've heard this a lot on this podcast, which is people are always saying, [27:31] you know, you can you can do the output. I'm sure you could take this link and send the email. But any time you're about to create an output instead, think about how can I create a tool or a prompt or an input that could do this for me? So instead of saying, great, I'm going to email this link, you say, great. [27:44] I'm going to build an MCP to send emails. Exactly. Yeah. Like how do you take that whole end to end workflow and solve every part? Yeah. [27:51] Okay, so let's talk it into Vibe coding at MCP for us. So the way I like to do this usually is I'll start with the scaffolding and making sure that everything kind of connects and works. So I won't even say MCP at first, I'm going to just say like, let's try to send an email. So I'll tell it like I have a scaffolded hyphen package here using UB. I'll tell it about that .env file I mentioned. [28:16] And I'll tell it what's in there, because I don't want it to actually read this. It's like, you know, I don't want the model to know about my API.

28:24-29:55

[28:24] And so then I'll say, "Can you make a Python script in this directory that sends a test email?" And this is the vast majority of the work. It going from a blank slate to sending a test email. [28:39] That's the hard part. The NCP half is really easy. So [28:43] Yeah, I always start that way. Like, let's just prove that everything works. That you can do the thing and then we can figure out how to invoke the tool. [28:50] Yeah. [28:51] And this is actually another feature of Goose that I'm kind of glad it accidentally triggered is that it tried to read [28:56] the .env file because it was like, oh, what am I working with? And we denied that, actually, because it's like a security concern. So there's a bunch of stuff that we try to catch [29:07] And so it can't see it, but it's going to have to just trust me that what I said is in there and then use it. [29:13] And so, okay, let me follow along over here. So I'm going to just... [29:18] We have a new file now, send test email. [29:22] that should be using this. And it just wrote that and if I wanted to like follow along in detail, I could do I can I could read it in an editor or something like that. But for the most part, I'm just going to trust it and hope it works. [29:35] And so let's say, okay, can you, um, and I'm going to give it that demos email. Got it. So now it's written the script and you're just going to double check this thing works and that I'm going to get. [29:47] an email out of the script. [29:49] And [29:50] So it figured out what I meant by the dot end. So it's like, okay, I found a thing.

29:55-31:27

[29:55] And [29:57] It's got. [29:58] this hard-coded recipient that it's going to go edit. And you can see like if I like just read down there, it like just fix that for it now that I told it who it's going to send it to. [30:08] And... [30:09] It's figuring out all this virtual stuff. OK, so just AI just like us. [30:15] figuring out its virtual environment that is the theme of today's episode [30:21] Okay, and then let me go over here and see if we got it. And I'm sure it's going to end up in spam or something. Oh, there it is. And so we got a test. [30:31] email sent with Python. [30:34] And it goes to spam because obviously this is all demo stuff. [30:37] That's not surprising. And whenever I, you know, I get engineers asking me a lot, they're like, Mailgun, it's well documented. I can write the thing. But I think about the number of characters you wrote in Goose and just the number, like the typing that had to happen in the file. And I just think it has just saved you. [30:57] 15 minutes of time of like looking at the docs, copying it over, you know, checking that it works. And we did it in probably two minutes. Yeah, exactly. And like, I just I've I've never built an email sending service. So it's like, I don't know this API at all. It didn't have to ask me for that. It's like, oh, it knew the endpoint name and everything. That's all trained in, which is just fantastic. [31:17] Thank you. [31:18] Okay, so now that this is working, let's turn it into an MCP. And then this is an example of like a little bit of model knowledge going a long way. So this model,

31:28-33:05

[31:28] isn't trained recently enough to know about the MCP SDKs. So that's a thing where I'm going to actually go get it that context so that it can use it. And so I'll just go find like the... yeah, MCP Python SDK. [31:44] And we'll just grab it from... [31:46] the read me and so at this point I could actually tell goose the URL and like it could download this read me and it would work but I rather just like copy and paste honestly because it's easier. [31:56] I'm the same as you. I'm just like, let's just drag the whole page and paste it in. Yeah. [32:03] Yeah, and it's like, okay, great. Like, it's already got these links. All I need is just this one example, and that should be enough. [32:09] And so I'll say, you know, OK, be nice to your LLMs, of course. I agree. Be nice to your LLMs. So I'm going to say now we're going to make an MCP server out of this. [32:22] Here's reference code. And then I'll say, "Can you use that example to make an NCP with a send email tool?" And we'll see how far it gets with no other context, actually. We might need to tweak stuff, but hopefully that's like a good start for it. And while it's thinking, are you doing, does Goose auto select models, do you have a default? [32:48] Do you have a favorite? We currently don't auto select models because people bring so many different models to Goose. And so for us, [32:57] We kind of switch context by context like Jackie was mentioning. So for writing code and using these tools, currently I'm using Cloud Sonnet, which is great.

33:05-34:35

[33:05] But if I wanted to go write a design doc, I might actually switch to one of the Reasoner models from OpenAI. [33:14] um or i could even like try out like a local only model and run qn [33:19] Thank you. [33:19] Okay, so let's see what it's doing. I'll check in again. And I'm just gonna like take a look. Okay, so it's got a new file. And it's combining like what we were doing before. And... [33:31] It's following that example pretty well, it looks like. [33:34] And [33:35] That's actually good code. [33:37] I'm going to interrupt it, honestly, because I know what it's going. It's going down like a slightly wrong path because it doesn't know how I'm going to integrate it. So. [33:45] I'm just going to pause that. [33:47] I like the judgment. [33:50] Yeah. Good code. It's definitely the right idea. I'm not going to read line by line, because I kind of trust it at this point. But what I am going to do is I'll make my one line contribution to this project right now, which is I know I want it to be running not as a remote URL. I want it to just be a standard IOMCP if you work in it. That's the difference here. So I'm just going to make it standard IOMCP. [34:14] And then [34:16] I'm going to ask it a weird question to help me set this up in Goose. [34:21] So I'm going to say to help me do that, can you get the path? [34:27] in this virtual Lens. So really, I'm just trying to make sure that I have a portable command so that I can run and use. And so I'm going to ask it to figure out that virtual Lens.

34:35-36:07

[34:35] so some that and that should just take like one or two tool calls from it [34:41] And is this your just general flow for making MCPs all the time? Build me the function, then here's the code to turn it into an MCP. I'm going to make my one line contribution. I'm going to ask it a... [34:53] this very specific question that I know works in my system. And then you're going to get it, get it going. Yes, exactly. And so once you have that, like a core functionality working, then you give it this example and turn it into the MCP. Exactly that. [35:07] And so, um, [35:09] Okay, so it got me this, like, this is the path of the virtual length Python, which is what I needed. [35:15] And then I'll ask you, like, also, what's the abstract with the Picon file? [35:21] And [35:23] That's going to be the two ingredients that I need to go make this work. [35:26] And... [35:28] It's just running shell commands to do this. So I've already got it there. So let's do this. And I'm going to go into that existing [35:38] settings. [35:39] And I'm just going to immediately add a new custom extension that we just built. So I'm going to call this email. We're going to do that path to the Python file. Oops, and I didn't paste the other one. So let me just go to the terminal and grab this. [35:53] And there, and we're going to do, we're just going to run that server we just created. And that should be everything. [36:02] Oops, I got an error. [36:04] Let me see what that is. I'm just going to paste the output.

36:08-37:40

[36:08] Oh, so there's actually a code error. [36:11] Oh, well there you go. [36:13] And I'm going, it wasn't. [36:15] Good code. And so I'm going to tell it this... [36:20] I'm going to tell it this error and see if it can just fix it. [36:24] So I like that you actually say I got an error. Can you help me? Because I just go straight to just paste the error. Yeah, it's probably spart it. I like spend a little too much time like having a conversation, honestly, with the LOMs. And... [36:40] uh, [36:42] And so hopefully it can figure this out without me thinking too hard about what's in the file. We can watch it work, of course. [36:51] And... [36:52] Honestly, I think the thing that it did here is this resource. [36:55] So it's fixing that, but... [36:58] Yeah. Okay. Okay. So it figured it out at about the same time I did. [37:03] Okay, so this is going to be the new podcast. It's [37:06] human developer, AI developer, who figures it out, who spots it first? [37:12] I the from the debugging flow is like something that I thought was going to be what kept me employed with the AI writing all these code. But I'm like, oh, it's actually pretty good. It's pretty fast. [37:25] Yeah. [37:26] So we're going to try that again now that we've edited that file. [37:30] I'm going to interrupt it because again, I think it's doing some extra stuff we don't need. [37:33] And I'm just going to [37:36] toggle that [37:38] Where did it end up in the list? I'm going to toggle that on again.

37:41-39:14

[37:41] oops [37:42] Okay, new error. Sorry about that. Normally this works in one shot actually. [37:47] Do you find as an engineer that you have shifted your debugging strategy to... [37:55] This, which is just, I'd rather loop with the agent and just get it to figure it out than... [38:03] sit here and use my eyeballs and my fingers to [38:06] solve this thing [38:07] I'm just curious how that's happened internally to yourself. I think I almost always let the LLM try once. [38:15] Like if it if it knows what to do to fix it, I'm like, great, that's faster than me figuring out. And then the thing that I think I am better at than it is, is like just being tenacious. So if it's like if it's not figuring it out, it's like, okay, I will take over and do it. [38:30] So this version [38:32] is like I'll take a look at like whatever idea. [38:37] it's coming up with and [38:39] If it seems like it's in the right direction, I'll let it finish. [38:43] We did it! Hooray. [38:46] Great. So now that we've got the like infars loading and we actually, so what we just did is we like turned that thing we just built on in the existing chat. [38:56] okay so it's it's a ex it actually should now see it so i'm gonna play like great we just enabled that [39:03] Can you see the new tools? Try sending a destino. [39:09] And let's see if that works. [39:11] So now we've at least got it running. And yeah, it sees the tools, right? So...

39:15-40:46

[39:15] And it's trying this get email status, which we don't need that. [39:19] But it's like, great, it sees that everything is ready to go. [39:23] And then [39:24] It's like [39:25] Sending me an MCPE email test. [39:28] And it is success. And so... [39:31] It's working. And so we kind of built it, plugged it into Goose, and then immediately tested it out. And let me go check my inevitable spam folder. [39:40] Yep, there it is. [39:41] You did it! Great. So despite all the spam checks, [39:46] We've got that working. And then now we can kind of finish that off by bringing it back to where we started. Right. So we know that works. I'm going to go over to this existing chat. [39:55] And these are two separate conversations, right? So like this, this LLM knew all about everything we just did. [40:01] I'm going to close that out. [40:02] This one has no idea that just happened. So I'm just going to go in and toggle that on and off real quick. [40:09] And [40:10] We're going to go back here and I'm going to say, "I just enabled an email tool. Can you send that payment link to my demo email address?" [40:26] So it doesn't have any of that, like it doesn't have to its memory doesn't have the functions or any of that stuff. [40:32] So this is the real test. Yes. [40:35] So it has no idea other than that [40:37] This thing just got created. [40:39] And it's doing the same thing that the other one did, which is like, OK, it's like checking and knows where it's coming from.

40:47-42:17

[40:47] And yeah, hopefully we'll get it to send something. [40:49] At this point, it's like writing more content, right? Because it's like, I'm sure it's going to try to be cute about... [40:54] the body. Are we going to get a pumpkin emoji in the perfect line? Right, yeah, that's what I'm hoping for. [41:00] Okay. And we should have something in spam. Yep. And so we've got a store payment. [41:08] We've got our payment link, order details, all of that stuff. [41:12] And, you know, the same link should bring us back. [41:17] So there are two levels of faces I make on this podcast. This is one of them where I go, ah! And then this is the other one where something works. So you've gotten the raised hands up. [41:28] Podcast reaction. So, yeah. [41:31] Just to recap everything Goose did for us today. I'm like still still processing. [41:36] Jackie, we got your CSV. We took some dirty produce data, got some insights on how to make it better. [41:43] Brad, you took that and populated a store [41:48] with your produce items, then created a way for people to buy those items through a nice little friendly link. Then you vibe coded an MCP. [42:00] live, [42:01] thrilling as somebody who's avoided building an MCP for no less than six weeks. Now I feel like I have no no excuse other than to just go do it. [42:11] And then you use that... [42:13] MCP. [42:15] to send somebody... [42:16] a pumpkin...

42:17-43:59

[42:17] purchase, you know. That's right. [42:20] This is so great. Okay. This has been incredibly fun. I have to ask you both as part of our lightning round questions. [42:30] Jackie, what's your favorite MCP tool? [42:33] in your stack right now? I hate to say it, but it's probably Google Drive. That is a manager where I live, and I'm just so happy to be able to not have to copy and paste and click and save me a ton of time. OK, Brad, what's your favorite? Definitely Shell. [42:49] I just, you know, I've lost I've lost all my memory of the arguments to various shell commands. It's just 100% filling them in for me at this point. Okay. And then if you could make a pitch to people inside an organization that are hesitant about getting started with AI or even people that are outside of technology that just say, I don't know, how could this apply to me? Why would I use it? Seems really complicated. What would you what would you say to them? [43:16] find the thing that you don't like doing and automate that like don't don't if you love programming don't try to automate your programming that's like fun for you you maybe figure out when when you want to make that trade-off but it's like oh i hate writing the unit tests that's the thing i'm gonna have the ai do [43:30] Okay. Get rid of toil. Spoken like a true, true engineer. Jackie, what about you? I think really treating it as a learning experience and realizing that this is the worst the models are ever going to be. [43:41] And so even if it's not something that, [43:44] transforms your daily life right now, even by keeping up with the tools and seeing where it fails. Seeing where it fails is just as useful as seeing where it succeeds and still really training as an experiment and a way to learn versus a magic bullet that's going to magically solve everything.

43:59-45:13

[43:59] Okay. And then Brad, we saw how polite you are. We have to ask both of you our favorite question, which is when... [44:05] AI is not doing. [44:08] what you want. [44:09] And you're starting, maybe you get exasperated. I don't know. What's your tactic? You seem very polite. So every, I saw every line started with great, but you really screwed that up. So Brad, what's your, what's your strategy here? So I thought this was kind, but I realized now it's even more brutal is, is that like the moment it's not good. I just, I just throw out the session. I throw out the whole convo and I start over and it's like, I, cause I, like, it's so much work to convince it to do the thing you want. [44:39] And then I'm like, oh, I guess that like that thread with that model, it's just done forever now. So it's kind of sad. But can you imagine if we did this at work where somebody started to say something that was totally wrong? You just like shut the door in their face and said, come back and just start start totally, totally over. OK, Jackie, what about you? When it's not functioning right, definitely start a new session. But what I like to do is have it. [45:04] I'll give it like really keywords that I want if I want the style of something to change. So, you know, if something comes out like too MBA, like a lot of times they'll say write it like you're a hacker.

45:16-46:29

[45:16] how to like summarize everything that it did, the good parts and then like take out the bad parts and start a new session based on those instructions. Great. I have learned so much myself. I'm going to go... [45:27] Download Goose and build that MCP that I've been avoiding. Where can we find more information about this? Where can we find you two? And how can we be helpful? [45:35] Definitely check out block.github.io/goose. That's the homepage, all the install instructions, and it should lead you also to our GitHub repo where we love contributions. And if you want to chat with us about it, we run a Discord for Block Open Source. So get in there and all of us are in there and we debug live and all that kind of stuff. So a lot of activity for sure. [45:57] Amazing. Well, thank you so much for sharing all this on the show. [46:01] Thanks for having us. This was fun. Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. [46:11] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. See you next time.

Want to learn more?

Ask about this episode