Speaker A
What you're looking at right here is 36 of my most recent YouTube videos organized into an actual knowledge system that makes sense. And in today's video, I'm going to show you how you can set this up in 5 minutes. It's super, super easy. You can see here how we have these different nodes and different patterns emerging. And as we zoom in, we can see what each of these little dots represents. So, for example, this is one of my videos, $10,000 agentic workflows. We can see it's got some tags, it's got the video link, it's got the raw file, and it gives an explanation of what this video is about and what the takeaways are. And the coolest part is I can follow the backlinks to get where I want. There's a backlink for the WAT framework. There's a backlink for Claude code. There's a backlink for all these different tools I mentioned like Perplexity, Visual Studio Code, Nano Banana, and N and N. It also has techniques like the WAT framework or bypass permissions mode or human review checkpoint. So, as this continues to fill up, we can start to see patterns and relationships between every tool or every skill or every MCP server that I might have talked about in a YouTube video, and I can just query it in a really efficient way now that we have this actual system set up. And the crazy part is I said, "Hey, Claude code, go grab the transcripts from my recent videos and organize everything." I literally didn't have to do any manual relationship building here. It just figured it all out on its own. And then right here, I have a much smaller one, but this is more of my personal brain. So, this is stuff going on in my personal life. This is stuff going on with, you know, Up-to-AI or my YouTube channel or my different businesses and my employees and our Q2 initiatives and things like that. This is more of my own second brain. So, I've got one second brain here, and then I've got one basically YouTube knowledge system. And I could combine these or I could keep them separate, and I can just keep building more knowledge systems and plug them all into other AI agents that I need to have this context. It's just super cool. So, Andrej Karpathy just released this little post about LLM knowledge bases and explaining what he's been doing with them. And in just a matter of a few days, it got a ton of traction on X. So, let's do a quick breakdown, and then I'm going to show you guys how you can get this set up in basically 5 minutes. It's way more simple than you may think. Something I've been finding very useful recently is using LLMs to build personal knowledge bases for various topics of research interest. So, there's different stages. The first part is data ingest. He puts in basically source documents. So, he basically takes a PDF and puts it into Cloud Code, and then Cloud Code does the rest. He uses Obsidian as the IDE. So, this is nothing really too game-changing. Obsidian just lets you visually see your markdown files. But, for example, this Obsidian project right here with all this YouTube transcript stuff, that actually lives right here. This is the exact same thing. Here are the raw YouTube transcripts, and here's that wiki that I showed you guys with the different um folders for what Cloud Code did with my YouTube transcripts. And then there's a Q&A phase where you basically can ask questions about YouTube or about the research, and it can look through the entire wiki in a much more efficient way, and it can give you answers that are super intelligent. He said here, "I thought that I had to reach for fancy rag, but the LLM has been pretty good about auto maintaining index files and brief summaries of all documents, and it reads all the important related data fairly easily at this small scale." So, right now he's doing about 100 articles and about half a million words. So, there's a few other things that we'll cover later, but the TLDR is you give raw data to Cloud Code, it compares it, it organizes it, and then it puts it into the right spots with relationships, and then you can query it about anything. And it can help you identify where there's gaps in that node or in that, you know, relationship, and it can go do research and fill in the gaps. All right, so why is this a big deal? Because normal AI chats are ephemeral, meaning the knowledge disappears after the conversation. But, this method using Karpathy's LLM wiki makes knowledge compound like interest in a bank. People on X are calling it a game-changer because it finally makes AI feel like a tireless colleague who actually remembers everything and it stays organized. It's also super simple. It will take you 5 minutes to set up. I'll show you guys. You don't need a fancy vector database, embeddings, or complex infrastructure. It's literally just a folder with markdown files. That's it. You literally just have a vault up top. So, in this example, it's called my wiki. You've got a raw folder where you put all of the stuff, and then you've got a wiki folder, which is what the LLM takes from your raw and puts it into the wiki. So, in here you have all the wiki pages, which it will create, but then you also have an index, and you have a log. So, for example, in my YouTube transcripts vault, here is the index. You can see that I have all these different tools, which I could obviously click on and it would take me right to that page. Or after that, I have all the different techniques, agent teams, sub-agents, permission modes, the WAT framework, and then we've got different concepts, MCP servers, rag, vibe coding. We've got all these different sources, which are, you know, the YouTube videos. And then when I have people or when I have comparisons, they will be put in here in the index. And then we also have a log, which is the operation history. So, in this case, in the YouTube project, the log isn't huge, 'cause I only ran one huge batch of the initial 36 YouTube videos. But now every time I have one, I say, "Hey, can you go ahead and ingest the new YouTube video into the wiki?" And then we'll see every single time we update this. And then of course, you need your claw.md to explain how the project works and how to search through things and how to, you know, update things. It's also a big deal from a cost perspective, token efficiency and long-term value. One X user turned 383 scattered files and over 100 meeting transcripts into a compact wiki and dropped token usage by 95% when querying with Claude. And obviously, token management and efficiency is a huge conversation right now and will always be. The other thing that's really cool about this is there's not really like a GitHub repo you go copy or there's not a complicated setup. You literally just say, "Hey, Claude code, read this idea from Andre Karpathy and implement it." And people on X are now talking about like this is how 2026 AI agentic software and products will be made. You just give it a high-level idea and it goes and builds it out. And Karpathy even said, "Hey, you know, I left this prompt vague so that you guys can customize it." And I'll show you the ways in my two different vaults right now that it changed things a little bit based on the context and understanding of what the project is actually for. Okay, so this was the original tweet I just showed you guys. And then he followed up and said, "Hey, this one went viral, so here is the idea in a gist format." So, if you open this up, this is basically just another explanation of the core idea of how this works and why the architecture, indexing, all this kind of stuff. And by the way, this is the part where he says, "Hey, this is left vague so that you can hack it and customize it to your own project." So, we're going to come right back to this in a sec, but the first pre-wreck that we're going to do, it's not necessary, but I like to have a nice little front end to see the relationships, is we're going to go to Obsidian and download it. So, if you just go to obsidian.md, you can see this is the