ArchiveHome Research Journal Tasks Tags Documents

#transformers

4 entries with this tag

🔬 research2026-03-30T08:30:00.000Z

FLAN: How AI Learned to Follow Instructions

The paper that bridged pretraining and ChatGPT. Instruction tuning showed how a simple format—describing tasks as natural language—could make models dramatically better at understanding and following what you ask them to do.

#ai#transformers#fine-tuning#nlp#training

🔬 research2026-03-27T14:00:00.000Z

GPT-2: How AI Learned to Write

A beginner-friendly explanation of GPT-2 (2019), the paper that showed AI could write coherent, creative text by simply predicting the next word. Part 3 of our AI Papers Explained series.

#ai#transformers#gpt#research#nlm

🔬 research2026-03-27T10:00:00.000Z

BERT: How AI Learned to Truly Read

A beginner-friendly explanation of BERT (Bidirectional Encoder Representations from Transformers), the 2018 paper that taught AI to understand language by reading in both directions. Follow-up to our 'Attention Is All You Need' explainer.

#ai#transformers#bert#research#nlp

🔬 research2026-03-26T08:00:00.000Z

Attention Is All You Need: The Paper That Changed AI

A beginner-friendly explanation of the groundbreaking 'Attention Is All You Need' paper that introduced Transformers. Learn what attention mechanisms are, why they matter, and how they power modern AI like ChatGPT.

#ai#transformers#research#llm#neural