Skip to content

maverickg59/awesome_ai_resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 

Repository files navigation

AI Resources

AI for Leaders

Learn About AI

AI Engineer Handbooks:

Research Blogs:

Engineer Blogs:

Deep Learning:

Machine Learning:

Natural Language Processing (NLP):

Convolutional Neural Networks:

Large Language Models:

Stanford YouTube Videos

Stanford Online's Playlists (look for AI collections):

Stanford CS229 Links

Stanford CS25 Links

Applied AI

Agentic Editors:

Chat Apps

Agentic Modeling:

Prompt Engineering:

Model Context Protocol:

Running Locally - Model Engine

Open Source Models

One Bit LLMs

Comparing Models:

Tools:

Research Papers:

Ilya Sutskever's Top 30 (Purported | PDF Where Possible)

Year Paper Title Publication Details Link
1993 Keeping Neural Networks Simple by Minimizing the Description Length of the Weights NIPS 1993 PDF
2004 A Tutorial Introduction to the Minimum Description Length Principle Online Publication arXiv
2008 Machine Super Intelligence PhD Thesis Google Drive
2011 The First Law of Complexodynamics Blog Post scottaaronson.blog
2012 ImageNet Classification with Deep Convolutional Neural Networks NIPS PDF
2014 Quantifying the Rise and Fall of Complexity in Closed Systems: The Coffee Automaton arXiv arXiv
2014 Neural Turing Machines arXiv arXiv
2015 Recurrent Neural Network Regularization arXiv arXiv
2015 The Unreasonable Effectiveness of Recurrent Neural Networks Blog Post karpathy.github.io
2015 Pointer Networks arXiv PDF
2015 Understanding LSTM Networks Blog Post PDF
2015 Deep Speech 2: End-to-End Speech Recognition in English and Mandarin PMLR PDF
2015 Deep Residual Learning for Image Recognition arXiv arXiv
2016 Order Matters: Sequence to Sequence for Sets arXiv arXiv
2016 Multi-Scale Context Aggregation by Dilated Convolutions arXiv arXiv
2016 Neural Machine Translation by Jointly Learning to Align and Translate arXiv arXiv
2016 Identity Mappings in Deep Residual Networks arXiv arXiv
2017 Variational Lossy Autoencoder arXiv arXiv
2017 Kolmogorov Complexity and Algorithmic Randomness Henry Steinitz PDF
2017 Neural Message Passing for Quantum Chemistry arXiv arXiv
2017 A Simple Neural Network Module for Relational Reasoning arXiv arXiv
2017 Attention Is All You Need arXiv arXiv
2018 The Annotated Transformer Workshop Paper nlp.seas.harvard.edu
2018 Relational Recurrent Neural Networks arXiv arXiv
2018 GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism arXiv arXiv
2020 Scaling Laws for Neural Language Models arXiv arXiv
2020 Dense Passage Retrieval for Open-Domain Question Answering arXiv arXiv
2020 Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks arXiv arXiv
2023 Lost in the Middle: How Language Models Use Long Contexts arXiv arXiv
2023 The Perils & Promises of Fact-checking with Large Language Models arXiv arXiv
2023 Zephyr: Direct Distillation of LM Alignment arXiv arXiv
2023 Better & Faster Large Language Models Via Multi-token Prediction arXiv arXiv

I created a NotebookLM audio overview of these papers. Remember that it's AI generated and not a full substitute for reading and understanding the studies:

The Illusion of Thinking

Year Paper Title Publication Details Link
2025 The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity (Apple) arXiv arXiv
2025 The Illusion of the Illusion of Thinking (Anthropic) arXiv arXiv

Recent Papers

Year Paper Title Link GitHub (if found)
2020 T5 PDF GitHub
2020 GPT‑3 PDF N/A
2020 RAG PDF GitHub
2022 Chain-of-Thought Prompting PDF N/A
2022 Constitutional AI PDF GitHub
2023 Gorilla: Large Language Model Connected with Massive APIs arXiv N/A
2023 GPT‑4 Technical Report PDF N/A
2023 Llama 2 PDF GitHub
2023 Instruction Tuning Survey PDF GitHub
2023 Direct Preference Optimization (DPO) PDF GitHub
2024 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits arXiv N/A
2024 Mixtral of Experts PDF N/A
2024 Learning to Retrieve In‑Context Examples PDF GitHub
2024 xLSTM PDF GitHub
2024 Visual Autoregressive Modeling PDF N/A
2024 Learning Interactive Real‑World Simulators PDF GitHub
2024 Debating with More Persuasive LLMs PDF GitHub
2025 M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models arXiv N/A
2025 PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers arXiv N/A
2025 Self-Adapting Language Models arXiv N/A
2025 RAG+: Enhancing Retrieval-Augmented Generation with Application-Aware Reasoning arXiv N/A
2025 Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics arXiv N/A
2025 Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities. Google N/A
2025 Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives arXiv N/A
2025 Play to Generalize: Learning to Reason Through Game Play arXiv N/A
2025 Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought arXiv N/A
2025 Reinforcement Pre-Training arXiv N/A
2025 Build the web for agents, not agents for the web arXiv N/A
2025 Large Language Models and Emergence: A Complex Systems Perspective arXiv N/A
2026 Large Language Model Reasoning Failures arXiv GitHub
2026 FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation arXiv GitHub

Older Papers

Year Paper Title PDF Link GitHub Link
2013 Efficient Estimation of Word Representations in Vector Space PDF GitHub
2014 Generative Adversarial Networks (GANs) PDF GitHub
2015 ImageNet Large Scale Visual Recognition Challenge PDF N/A
2018 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding PDF GitHub
2019 RoBERTa: A Robustly Optimized BERT Pretraining Approach PDF N/A

Potentially Controversial

Year Paper Title Link
2025 Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce arXiv
2025 Don't Pay Attention arXiv

Wiki Articles on Important Concepts:

About

A curated list of resources tailored towards AI Engineers

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors