AI Resources

AI for Leaders

Learn About AI

AI Engineer Handbooks:

Research Blogs:

Engineer Blogs:

Deep Learning:

Machine Learning:

Natural Language Processing (NLP):

https://vinija.ai/nlp/index.html

Convolutional Neural Networks:

Large Language Models:

Stanford YouTube Videos

Stanford Online's Playlists (look for AI collections):

https://www.youtube.com/@stanfordonline/playlists

Stanford CS229 Links

Stanford CS229 | Machine Learning | Building Large Language Models (LLMs)

Stanford CS25 Links

Applied AI

Agentic Editors:

Claude Code
Mastering Claude Code in 30 minutes (Anthropic)
Amp Code
OpenCode
VS Code with GitHub Copilot
Windsurf
Cursor
[RooCode](https://roocode.com/
Cline

Chat Apps

Agentic Modeling:

Prompt Engineering:

Model Context Protocol:

Running Locally - Model Engine

Open Source Models

One Bit LLMs

Comparing Models:

Tools:

Research Papers:

Ilya Sutskever's Top 30 (Purported | PDF Where Possible)

Year	Paper Title	Publication Details	Link
1993	Keeping Neural Networks Simple by Minimizing the Description Length of the Weights	NIPS 1993	PDF
2004	A Tutorial Introduction to the Minimum Description Length Principle	Online Publication	arXiv
2008	Machine Super Intelligence	PhD Thesis	Google Drive
2011	The First Law of Complexodynamics	Blog Post	scottaaronson.blog
2012	ImageNet Classification with Deep Convolutional Neural Networks	NIPS	PDF
2014	Quantifying the Rise and Fall of Complexity in Closed Systems: The Coffee Automaton	arXiv	arXiv
2014	Neural Turing Machines	arXiv	arXiv
2015	Recurrent Neural Network Regularization	arXiv	arXiv
2015	The Unreasonable Effectiveness of Recurrent Neural Networks	Blog Post	karpathy.github.io
2015	Pointer Networks	arXiv	PDF
2015	Understanding LSTM Networks	Blog Post	PDF
2015	Deep Speech 2: End-to-End Speech Recognition in English and Mandarin	PMLR	PDF
2015	Deep Residual Learning for Image Recognition	arXiv	arXiv
2016	Order Matters: Sequence to Sequence for Sets	arXiv	arXiv
2016	Multi-Scale Context Aggregation by Dilated Convolutions	arXiv	arXiv
2016	Neural Machine Translation by Jointly Learning to Align and Translate	arXiv	arXiv
2016	Identity Mappings in Deep Residual Networks	arXiv	arXiv
2017	Variational Lossy Autoencoder	arXiv	arXiv
2017	Kolmogorov Complexity and Algorithmic Randomness	Henry Steinitz	PDF
2017	Neural Message Passing for Quantum Chemistry	arXiv	arXiv
2017	A Simple Neural Network Module for Relational Reasoning	arXiv	arXiv
2017	Attention Is All You Need	arXiv	arXiv
2018	The Annotated Transformer	Workshop Paper	nlp.seas.harvard.edu
2018	Relational Recurrent Neural Networks	arXiv	arXiv
2018	GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism	arXiv	arXiv
2020	Scaling Laws for Neural Language Models	arXiv	arXiv
2020	Dense Passage Retrieval for Open-Domain Question Answering	arXiv	arXiv
2020	Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks	arXiv	arXiv
2023	Lost in the Middle: How Language Models Use Long Contexts	arXiv	arXiv
2023	The Perils & Promises of Fact-checking with Large Language Models	arXiv	arXiv
2023	Zephyr: Direct Distillation of LM Alignment	arXiv	arXiv
2023	Better & Faster Large Language Models Via Multi-token Prediction	arXiv	arXiv

I created a NotebookLM audio overview of these papers. Remember that it's AI generated and not a full substitute for reading and understanding the studies:

NotebookLM Audiobook

The Illusion of Thinking

Year	Paper Title	Publication Details	Link
2025	The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity (Apple)	arXiv	arXiv
2025	The Illusion of the Illusion of Thinking (Anthropic)	arXiv	arXiv

Recent Papers

Year	Paper Title	Link	GitHub (if found)
2020	T5	PDF	GitHub
2020	GPT‑3	PDF	N/A
2020	RAG	PDF	GitHub
2022	Chain-of-Thought Prompting	PDF	N/A
2022	Constitutional AI	PDF	GitHub
2023	Gorilla: Large Language Model Connected with Massive APIs	arXiv	N/A
2023	GPT‑4 Technical Report	PDF	N/A
2023	Llama 2	PDF	GitHub
2023	Instruction Tuning Survey	PDF	GitHub
2023	Direct Preference Optimization (DPO)	PDF	GitHub
2024	The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits	arXiv	N/A
2024	Mixtral of Experts	PDF	N/A
2024	Learning to Retrieve In‑Context Examples	PDF	GitHub
2024	xLSTM	PDF	GitHub
2024	Visual Autoregressive Modeling	PDF	N/A
2024	Learning Interactive Real‑World Simulators	PDF	GitHub
2024	Debating with More Persuasive LLMs	PDF	GitHub
2025	M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	arXiv	N/A
2025	PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers	arXiv	N/A
2025	Self-Adapting Language Models	arXiv	N/A
2025	RAG+: Enhancing Retrieval-Augmented Generation with Application-Aware Reasoning	arXiv	N/A
2025	Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics	arXiv	N/A
2025	Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.	Google	N/A
2025	Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives	arXiv	N/A
2025	Play to Generalize: Learning to Reason Through Game Play	arXiv	N/A
2025	Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought	arXiv	N/A
2025	Reinforcement Pre-Training	arXiv	N/A
2025	Build the web for agents, not agents for the web	arXiv	N/A
2025	Large Language Models and Emergence: A Complex Systems Perspective	arXiv	N/A
2026	Large Language Model Reasoning Failures	arXiv	GitHub
2026	FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation	arXiv	GitHub

Older Papers

Year	Paper Title	PDF Link	GitHub Link
2013	Efficient Estimation of Word Representations in Vector Space	PDF	GitHub
2014	Generative Adversarial Networks (GANs)	PDF	GitHub
2015	ImageNet Large Scale Visual Recognition Challenge	PDF	N/A
2018	BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	PDF	GitHub
2019	RoBERTa: A Robustly Optimized BERT Pretraining Approach	PDF	N/A

Potentially Controversial

Year	Paper Title	Link
2025	Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce	arXiv
2025	Don't Pay Attention	arXiv

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

AI Resources

AI for Leaders

Learn About AI

AI Engineer Handbooks:

Research Blogs:

Engineer Blogs:

Deep Learning:

Machine Learning:

Natural Language Processing (NLP):

Convolutional Neural Networks:

Large Language Models:

Stanford YouTube Videos

Applied AI

Agentic Editors:

Chat Apps

Agentic Modeling:

Prompt Engineering:

Model Context Protocol:

Running Locally - Model Engine

Open Source Models

One Bit LLMs

Comparing Models:

Tools:

Research Papers:

Ilya Sutskever's Top 30 (Purported | PDF Where Possible)

The Illusion of Thinking

Recent Papers

Older Papers

Potentially Controversial

Wiki Articles on Important Concepts:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages