From Neural Networks to Transformers

From Neural Networks to Transformers: Understanding the Foundation of Modern AI

Every AI tool people use today — from ChatGPT to Claude and Gemini — is built on a series of breakthroughs that unfolded over several decades of research. It’s tempting to jump straight into Large Language Models without understanding the building blocks underneath them, but doing so often leaves a gap in intuition that makes everything else harder to follow. Once you understand Neural Networks, Transfer Learning, Tokenization, Embeddings, Attention, and Transformers, modern AI becomes significantly easier to reason about — not as magic, but as a series of well-understood engineering ideas stacked on top of each other. ...

April 26, 2026 · 5 min · Ashish Mishra
Understanding Large Language Models

Understanding Large Language Models: How ChatGPT Actually Works

Large Language Models are among the most important breakthroughs in modern technology. They power the chat assistants, coding tools, and search experiences that millions of people use every day. Yet most people who use ChatGPT or similar tools have never seen what happens between typing a question and getting an answer. This post walks through the core ideas behind LLMs in plain language — what they actually are, how they decide what to say next, and why they sometimes get things wrong. ...

February 5, 2026 · 4 min · Ashish Mishra