# Zezhou Huang - Crack Any Codebase with AI (Highlights) ![rw-book-cover|256](https://readwise-assets.s3.amazonaws.com/media/uploaded_book_covers/profile_155788/7c79c94f-a746-4473-b88c-2c9d3ed489a1.jpg) ## Metadata **Review**:: [readwise.io](https://readwise.io/bookreview/61569142) **Source**:: #from/readwise #from/manning **Zettel**:: #zettel/fleeting **Status**:: #x **Authors**:: [[Zezhou Huang]] **Full Title**:: Crack Any Codebase with AI **Category**:: #books #readwise/books **Category Icon**:: 📚 **Document Tags**:: #from/manning **Highlighted**:: [[2026-06-23]] **Created**:: [[2026-07-04]] ## Highlights ### 1 The shift: Code is free now. Understanding isn't. - Comprehension debt is different. It’s code that works but nobody on the team fully gets. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-1/v-1/4a710486f3a1511ea5b5b727f5fc0bfb)) ^1027978110 - No more reps means no more growth. Understanding is no longer a passive byproduct of your daily work; it is a deliberate practice you have to choose. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-1/v-1/2834fca2b9c66204e20625751b95f503)) ^1027978111 - The Engineer treats every answer as a hypothesis. Each push-back builds your architectural muscle. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-1/v-1/f0aeb7f97e086e2be4eed86a4395b947)) ^1027978112 - An Engineer issues a specific command that forces the AI to produce something concrete, something you can check against reality. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-1/v-1/6d9355ff67ee1d1d50f6e8061e8356d0)) ^1027978113 ### 2 Chat: Paste code, get answers - Replace "keep it simple" with "at most 10 nodes, 2-4 words each." Replace "be concise" with "max 3 sentences per point." ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-2/v-1/66b3311065af6f5138e94a6f3b02afa5)) ^1027978115 - Copy-pasting AI output feels productive. It isn't. The act of rewording forces you to confront what you actually understand versus what you're just recognizing. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-2/v-1/1c71fbf4a840a3436b5b55a6ad3bc165)) ^1027978116 ### 3 Workflow: Build your own Codebase Knowledge Builder - Start with entry points. Look for the files that bootstrap the application: main.py, server.ts, app.py, and CLI commands. These files import everything important, so their import list is the architecture. Django's core/handlers/base.py is just 375 lines, but it loads middleware, resolves the URL, calls the view, and returns the response. Read that one file, and you know which ten other files matter. VS Code's workbench.common.main.ts imports every major subsystem (editor, terminal, debug, extensions), so you can see the full architecture from a single import list. Always pick a concrete implementation, not just the base class. If you only pick abstract base classes, you'll understand the shape of the code but not what it actually does. Think of it like reading a restaurant's menu template (font, layout, sections) without seeing any actual dishes. HuggingFace Transformers defines a base class called PreTrainedModel that all 440 AI models inherit from, and it handles loading, saving, and running models. But reading just that base class, you have no idea what a real model looks like. Pick one concrete model (say, BERT at 1,395 lines), and you see how the base class gets used in practice. We diffed BERT against another model called RoBERTa (1,272 lines): ~1,100 lines were copy-paste identical. The base class plus one example teaches you the pattern, and the other 439 are variations on the same theme. If you see N similar directories, pick one. Large repos don't have 500 unique files; they have 5 patterns repeated 100 times each. LangChain has 15 integration packages (OpenAI, Anthropic, Chroma, etc.) that all follow the same structure: same Makefile, same directory layout, same test patterns. An open-source chat platform we analyzed had 77 messaging extensions (Discord, Slack, Telegram, WhatsApp, Signal...) with nearly identical code in each. Reading one extension teaches you what all 77 do, so pick the most representative one and skip the rest. Skip plumbing, even when it looks important. Files like crypto.py, telemetryService.ts, and retry_handler.py have meaningful names and contain real code, but they teach you nothing about the architecture because they're utility code that could exist in any project. Django's .pre-commit-config.yaml tells you what linter they use, not how requests flow. VS Code's telemetryService.ts tells you how they track usage metrics, not how the editor works. LangChain has 21 lock files totaling 8.3 MB of pure dependency metadata. Every slot you burn on plumbing is a slot you can't spend on the code that actually makes the project unique. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-3/v-1/52468a8e4a1206686c7e01d4685b5732)) ^1027978118 - They don't read linearly or read everything; they identify the core first, then go deep on each piece. That's exactly the pattern we're about to build, because the context window is finite and attention is a zero-sum game: tokens spent on breadth are tokens not spent on depth. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-3/v-1/6e4a19b9750d7e0d90b530505d4bac64)) ^1027978119 - The fix is a pattern you'll use throughout this book: overview first, then zoom into each piece separately. ([View Highlight](https://livebook.manning.com/book/crack-any-codebase-with-ai/chapter-3/v-1/9844491630b49cc53bedb20d5730d454)) ^1027978120