Nicholas

Gemini Omni: Clone yourself with AI in under 15 minutes

Nicholas

In this experimental episode, I document my real-time attempt to create an AI avatar of myself using Google Flow and the new Gemini Omni video generation model. I walk through the entire process—from scanning my face with my phone to generating a complete one-minute hype video for the podcast, all in about 15 minutes. What you’ll learn: How to create an AI avatar using Google Flow in under five minutes Why video AI tools unlock creative possibilities for people with zero video production skills The step-by-step process of generating a full storyboard using AI as your creative producer How to use character consistency features to generate multiple video scenes with the same avatar The uncanny-valley moments you’ll encounter when your AI clone doesn’t quite nail emotions or physics How to stitch together AI-generated scenes into a complete video using built-in editing tools — Brought to you by: Merge —Connective infrastructure for production AI Jira Product Discovery —Prioritize with insights, build with confidence — In this episode, we cover: (00:00) Getting started with Google Flow and Gemini Omni (01:38) The avatar creation process: scanning and photo capture (02:55) Using Flow to brainstorm a hype video storyboard (06:59) Generating the first video scene with the avatar (08:41) Troubleshooting: accidentally generating images instead of videos (09:32) Generating all seven scenes for the complete video (11:37) Reviewing the avatar videos (13:13) Stitching the videos together in the browser-based editor (14:32) The complete How I AI hype video

Published
Published Jun 3, 2026
Uploaded
Uploaded Jun 12, 2026
File type
POD
Queried
0

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:45

[00:00] Today I am doing a very strange episode where I'm going to create a video avatar of myself [00:06] And in about 15 minutes, get to a full minute long video starring none other than your favorite podcast host. [00:13] Claire Ho. [00:14] Let's get to it. [00:16] This episode is brought to you by Merge. Building an AI product is one thing. The hard part is everything around it. Connecting to the tools your team and customers rely on, letting agents take action with the right permissions, and keeping everything reliable and cost efficient, [00:33] once you're in production. Most teams end up piecing that together themselves. So instead of building the product you actually care about, you get pulled into integrations, permissions, routing, and all the infrastructure underneath. Merge is the infrastructure layer for production AI. It connects to thousands of tools, gives agents secure ways to act inside them, and optimizes model routing and spend. [00:58] without you building or owning any of it. [01:00] OpenAI, Dropbox, and Ramp already use Merge to move fast and build AI right. Visit merge.dev slash howiai to start building for free. [01:12] This episode of How I AI is going to be an adventure because I'm going to be honest, I'm not 100% sure this is going to work. I'm going to return to a product I covered very briefly a couple of weeks ago called Google Flow and the new Gemini Omni video generation model. And I'm going to try this. [01:29] really hard to create an AI avatar of myself that we can animate or I guess cinematically create using AI. So this is Google flow and one of the features of Google flow and the Omni model is you are supposed to be able to create

1:45-3:16

[01:45] an avatar of yourself. Now we tried this the day it came out. It did not work, but we're going to give it another college try and see if we can get a full featured avatar of myself that then we can go and build consistent character videos off of. So I'm going to select up here. I'm going to create an avatar. We're going to click get started. [02:03] I'm gonna scan this QR code. I have my phone here. I've done this before, so hopefully it'll be fast. [02:10] Okay, I'm gonna put the mic away just for one second. [02:14] I'm going to allow access to my camera and we're just going to take some photos. [02:18] Okay. [02:19] Ready? Start. [02:21] 17? [02:23] 81. [02:24] - 49? [02:26] 20? [02:27] 25? [02:28] 22? [02:30] Okay, now it's having me turn my head. [02:34] So I turned my head that way, gave me a checkmark. [02:37] Turn my head the other way. [02:39] It's giving me... [02:40] A check mark. [02:42] And [02:42] And it says we're done. Now, it said we were done last time we tried this. So we're going to see. It's going to take a couple minutes, and then we will come back. [02:51] and see if I can actually use this avatar of myself. [02:55] Okay, so look at this beauty. There's this fisheye lens version of... [03:00] of me that is now an avatar. So I supposedly can use this and let's use it to create a hype video for the How I AI podcast. [03:08] So I'm going to go in here and say, help me create [03:12] a storyboard for a hype video

3:16-4:47

[03:16] All right. [03:17] For the How I AI podcast, I already have a character... [03:24] named me, we can reference [03:27] Help me come up with the few scenes [03:31] that would make this great. [03:34] This is a podcast by Claire about the best ways to use AI at work. [03:41] and in life! Exclamation mark. Okay. So what I love about flow or what has pitched to me about flow is that it's not just a video generation tool. It's actually a whole creative suite. [03:52] And so ideally, it's going to be able to help me not only animate or video generate this avatar of myself. It's also going to help me actually brainstorm what this overall video could be. And I'm, you know, I'm creative, but I'm not video creative, so I'm excited to see what it looks like. [04:11] How do you imagine Claire? Is she in a modern studio or perhaps a bright, airy home office? [04:15] Should it feel high-tech and sleek or more grounded and lifestyle-focused? [04:19] And are we going for high energy and fast pace and thoughtful and striving? So I'm going to say, [04:23] She is in a dark... [04:25] Home, office, dark green walls. [04:29] with books about AI and fun posters lighting around and, [04:36] This should be... [04:39] more authentic [04:42] lifestyle version... [04:45] But it's high tech and

4:48-6:28

[04:48] about coding. [04:51] have a hacker vibe to it. [04:54] Okay, a bunch of typos, but we'll see what this does. And what I love about these video models and these new tools, again, usually here on How I AI, we talk about [05:04] coding, we talk about website generation, we talk about PRDs and work product. But what I really appreciate about these new generative AI models, in particular, these multimodal ones, image and video, is it unlocks for me [05:20] an ability to generate, create something that I would have never imagined. [05:25] been able to do before. So I would have never been able to solo produce a hype video for my podcast. I would have a hard time brainstorming it. I wouldn't know how to frame it. I wouldn't know how to block it. But now I have this AI producer here that can help me with this effort. So I'm [05:42] Let's see what the frames are. It's about seven frames. [05:44] It's going to be an extreme close up of me typing on a mechanical keyboard, totally on brand. Then there's going to be a wide shot of the office. Then it's going to reveal me in my ergonomic chair. Spoiler alert, I am not actually in an ergonomic chair. [06:00] I'm going to spin around. That's going to be funny. And it's going to give me a digital heads up display, which is also ridiculous. But let's let it happen. [06:09] Then it's going to do a very, what I'm presuming to be a very cheesy AI montage, a lifestyle moment, a call to action. I'm going to hit you with the podcast microphone, and then it's going to say how I AI. If this looks good, I'm going to say...

6:29-8:01

[06:29] This is great. [06:30] Generate the storyboard. I already have the character. [06:36] at me [06:37] Um, and so I'm going to send that. We're going to see what it comes up with. I've noticed that it has a hard time referencing the me character in some early tests. So let's see what it comes up with. I'm presuming it's going to take a couple of minutes. So we will take a mini break and then come back to see what it comes up with. [06:54] what it looks like. [06:56] Okay, it looks like it's generating a grid for the storyboard. It can't use the avatar, so I think it's going to do it without the character reference. It'll be really interesting to see what it comes up with. But then as soon as it's ready, I'm going to go ahead and generate at least a couple of these storyboard scenes one by one, and we can see how well it does with my avatar. [07:18] Oh, I mean, this is delightful. Look at this glowy mechanical keyboard. Look at how... [07:25] I am hacking on three keyboards. I'm going to make [07:28] little eyes at you with my fake glasses, my very trendy glasses, [07:33] There's going to be me dragging and dropping a file that probably says like AI.MD. I'm going to smile and I'm going to speak into the podcast. [07:41] This looks great. [07:44] So what I think I'm going to do is I'm going to paste in this first frame of the video that the agent came up with. And instead of saying Claire, I'm just going to. [07:52] @mentionin. [07:54] this avatar that it gave me so that we can see if it generates this video with me

8:01-9:31

[08:01] as the character. [08:03] And so I think I've replaced my name here. [08:08] I've given details on camera, on lighting, on everything. I press enter. Let's see. [08:14] what it creates with my avatar. I have no idea what we're going to get into and hopefully it won't be terrifying. Okay, I'm already nervous. What is surprising to me that I didn't actually expect is it does have my posters and my books background here, I guess because they're behind me when I took the photo. It's taking advantage of that. And I'm going to share my audio as well. And we're going to see how this video worked. [08:40] Okay, I got that wrong. I actually generated images instead of videos. Totally messed up. Did not click the right thing down here in the bottom right. I had image generation instead of video generation. So again, I'm going to paste that. [08:53] walkthrough of the scene here. I'm gonna replace my name. [08:58] with the Mii Avatar. [09:00] It's going to have my fingers flying across that mechanical keyboard. It's going to be so cool. [09:06] I'm gonna go ahead and press, [09:08] Send and we're gonna see how long it takes to generate a video now Something you'll notice about every time you generate videos. It used to work like this in Veo 2. So I'm not Veo 3 as well [09:20] So I'm not surprised they do this as they're generating two versions of it. It's going to take a couple of minutes. The image took... [09:26] a couple seconds. These are probably going to take a couple minutes. So I will come back and hopefully we will have our first video.

9:31-11:13

[09:31] with Claire's face in it. [09:33] And while we're waiting, I'm going to queue up one or two other scenes and see if we can get ones going with my actual face in it. Because some of these had like the back of my head as opposed to my face. And I think we want to see what my face avatar looks like. So we'll pick. [09:51] Frame 3 and see if we can get that going as well. [09:55] Okay, the first video generated. Now we have blue nail polish. I still like it. Okay. [10:00] Let's see. [10:03] We were told AI would replace us. [10:09] That is quite spooky. Okay, we were told AI is going to replace us. Let's see if the video with me actually generates a callback to that. So while that's generating, I'm going to go ahead and make all of these. We're going to stitch them together. It's going to be so awesome. So stick with us. [10:29] We're going to generate a bunch of videos and we're going to stitch it together into one long hype video. [10:35] This episode is brought to you by Jira Product Discovery. AI has made individual PMs incredibly productive, but multiplayer mode is where it still breaks, getting everyone aligned on what should actually get built. [10:49] Decisions live in a markdown file from last week. The roadmap's a spreadsheet no one's looking at. Jira product discovery is where teams actually decide what to build. [10:59] capture ideas, prioritize them as a team, and share a living roadmap everyone works from. It's powered by Atlassian's Teamwork Graph, so it can pull in customer feedback, what your team shipped, plus your goals, and suggest what to build next.

11:13-12:46

[11:13] And when a decision is made, you can hand it off straight to JIRA, so a developer, or even an agent, can pick it up and start building. [11:21] Teams at Canva, Deliveroo, and Toast already use Jira Product Discovery. Join more than 25,000 teams at atlassian.com slash howiai. Start building the right things together. [11:37] Okay, I have seven scene generating, but while we are waiting for those to finish, we're [11:43] I, [11:43] Just cannot... [11:46] Sorry. [11:48] Sorry for you all that are listening and not watching. I just got jump scared by the AI version of myself wearing glasses, turning around in a spinning chair. So let's take a look at both of these. This one's pretty good. I'm spinning in a circle. Okay, sorry. Back to those I need to describe this for. [12:10] So this is using an AI avatar of myself. The prompt was I spin my ergonomic chair around to face the camera. I push my glasses, which I don't have, up to the bridge of my nose. And I say, this is Claire. I am Claire. And this is how I let's watch. [12:25] V1 of this video, which is actually a scream riot. [12:33] I'm Claire, and this is how I AI. [12:37] Okay, it was actually pretty good. What's really funny is I do have the, it has the NVIDIA way in the background.

12:46-14:17

[12:46] which I don't have right here, but I do have upstairs. So I do believe the AI overlords are really [12:52] paying attention. I want to make you laugh and look at the second version where I spin in a circle twice. Pretty good. [13:01] I'm Claire and this is how I AI. [13:06] This one got my not curled hair a lot better, but I prefer the other video. It makes me look a little bit nicer. Okay, I'm going to take one minute. I'm going to stitch all these videos together [13:19] in the form factor that Gemini told me I should, that Flo told me I should. We're going to bring this hype video together. I'm going to show it to you end to end. And then I'm going to conclude today's very strange episode of How I AI. [13:32] where I use my avatar to create an end to end hype video for this podcast. [13:37] Cool. So it actually seems like I can show you a little bit of how we're going to stitch this video together. So if you see here, once I click into any one video, [13:44] I have a video editor timeline here that I can use right in the browser to stitch together all these videos. So I'm going to go ahead and add these in the order that [13:54] The original AI told me my hype video should go and then we'll look at it end to end and we'll see if we really like it. [14:01] Okay, this took me about five minutes, but all I did was stitch together my favorite versions of all these avatar-generated AI videos, scene by scene, about seven of them together, to show one end-to-end hype video.

14:31-16:03

[14:31] and now the worldwide debut of the How I AI hype video, [14:37] I am going to show you [14:39] Who knows? [14:41] what we're about to get, but we're about to get it. [14:43] Here we go. We were told AI would replace us. [14:52] Thank you. [14:55] Oh my God. [14:57] I'm Claire, and this is how I AI. [15:00] From automating the mundane to dreaming up the impossible. [15:08] It's about the tools that change the way we live and work. [15:14] Join me as we deconstruct the future, one prompt at a time. Subscribe to How I AI. [15:25] How I AI. Available now, everywhere you get your podcasts. [15:32] Okay. I... [15:36] I'm actually obsessed with this. Let's talk about what I love and what I don't. What I love. This took zero time and effort. And it is, I wouldn't say it's like 80% there, but is it 50% there? [15:50] 100% yes. Am I going to tweet this immediately? [15:54] Absolutely. Did this take no effort? [15:58] basically no effort, no knowledge. Okay. So what did I like about this avatar experience?

16:03-17:41

[16:03] You know what? This is like kind of my face. It's not quite my face. I would say about 50% of the time it's my face and 50% of the time it's like an uncanny version of my face. Some things I noticed from a character consistency perspective. This gave me beautiful long wavy hair, which I have recently cut off because I have a child. So you see there's like a location inconsistency, this background. [16:27] has books and a... [16:31] an hourglass this background is a different color it has plants it pulls in some things from my avatar like it pulls in this poster that was in the background of when i took my photos and it changes a little bit over time and so you can see the books on the shelf change the lighting changes [16:51] As always, these video gen and image gen models are really early 2000s coded on what they think AI and impressive technology is. So I'm holding like... [17:02] a 24-inch iPad in this video, looking at a schematic of, it looks like a church. It's very confusing. The heads-up display that shows up [17:12] on my face when I'm looking at AI. I'm apparently coding in in Gemini a robot of some sort. So it's pretty hilarious. But even looking at this frame, I would say this is the one that felt [17:26] Like it looked most like my face. Like I'll just try to look serious so you all can see. [17:30] Thank you. [17:31] It's pretty good. It's even got my sun damage here. So good job, Gemini, not smoothing out my face. And so I do think this is...

17:41-19:13

[17:41] 90% there. Not 100% there, but it's really interesting even seeing my face turn left and right, how accurate it got on the side profiles of my faces. Now, this [17:55] scene right here where I'm laughing [17:57] 100% uncanny valley. I look very strange, like I'm on some side of medication, perhaps. And so I'm not sure it 100% has emotions really well. And some of the timing and hiccups you noticed while you were watching the video, I spoke over myself. [18:15] Those sorts of things. But this scene right here... [18:19] is legitimately pretty good. I bet with some consistent background prompting, with a little bit more effort, with some additional images going into this Omni model, I think I can make a hype video [18:32] that would convince most of you, if not all of you. Now, [18:38] Do I think it's great at typography? Do I think it's great at graphics? No, this is kind of lame. This ending part is kind of lame. Yeah. [18:45] But again, we're talking. [18:47] It's probably 10 minutes. [18:49] top to bottom. [18:51] So we're talking probably 15 minutes from very beginning, knew nothing about this tool to I have this one minute video now I can share with you all. [19:02] I'm pretty blown away, you guys. And so I'm going to go spend a little bit more time with the Google Omni model. I'm going to spend a little bit more time with Flow. This might be...

19:13-20:33

[19:13] my new favorite hobby project. I'm kind of obsessed with it. I want to hear if you all are willing to put your avatar in here, if you can actually get it to generate consistent characters and what your experience is using these kind of incredible new video models. So I know this is a little bit of a different style of how I AI. We usually do coding. We usually do work stuff. This is a tool I did not know. This is a process I'm very unfamiliar with. And I really think [19:43] I got an outcome that was much better than I expected with very little knowledge of the tool. So if that is not a How I AI success story, I'm not sure what is. [19:54] I hope you enjoyed this very strange mini episode of How I AI. I cannot wait to see what you generate. And please share your examples in the comments. Thanks for joining. [20:07] Thanks so much for watching. If you enjoyed the show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. [20:15] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. [20:32] See you next time.

Want to learn more?

Ask about this episode