## بسم الله الرحمن الرحيم


### Variational Autoencoders (VAEs)
*   [Auto-Encoding Variational Bayes (Kingma & Welling, ICLR 2014)](https://arxiv.org/abs/1312.6114)

### Generative Adversarial Networks (GANs)
*   [Generative Adversarial Networks (Goodfellow et al., NIPS 2014)](https://papers.nips.cc/paper/2014/hash/5ca3e9b122f61f8f06494c97b1afccf3-Abstract.html) (Note: Original NIPS link - please verify access)
*   [Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (DCGAN) (Radford et al., ICLR 2016)](https://arxiv.org/abs/1511.06434)

### Sequence-to-Sequence Models & Attention
*   [Sequence to Sequence Learning with Neural Networks (Sutskever et al., NIPS 2014)](https://arxiv.org/abs/1409.3215)
*   [Neural Machine Translation by Jointly Learning to Align and Translate (Bahdanau et al., ICLR 2015)](https://arxiv.org/abs/1409.0473)
*   [Attention Is All You Need (Transformer) (Vaswani et al., NIPS 2017)](https://arxiv.org/abs/1706.03762)

### Optimizers & Normalization
*   [Adam: A Method for Stochastic Optimization (Kingma & Ba, ICLR 2015)](https://arxiv.org/abs/1412.6980)
*   [Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (Ioffe & Szegedy, ICML 2015)](http://proceedings.mlr.press/v37/ioffe15.html)

### Computer Vision Architectures
*   [Going Deeper with Convolutions (GoogLeNet/Inception) (Szegedy et al., CVPR 2015)](https://www.cv-foundation.org/openaccess/content_cvpr_2015/html/Szegedy_Going_Deeper_With_2015_CVPR_paper.html)
*   [Very Deep Convolutional Networks for Large-Scale Image Recognition (VGGNet) (Simonyan & Zisserman, ICLR 2015)](https://arxiv.org/abs/1409.1556)
*   [Deep Residual Learning for Image Recognition (ResNet) (He et al., CVPR 2016)](https://arxiv.org/abs/1512.03385)
*   [You Only Look Once: Unified, Real-Time Object Detection (YOLO) (Redmon et al., CVPR 2016)](https://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Redmon_You_Only_Look_CVPR_2016_paper.html)

### Normalization & Regularization (Continued)
*   [Layer Normalization (Ba et al., NIPS 2016)](https://arxiv.org/abs/1607.06450)

### Key 2017-2020 Papers
*   [Attention Is All You Need (Vaswani et al., NIPS 2017)](https://arxiv.org/abs/1706.03762)
*   [U-Net: Convolutional Networks for Biomedical Image Segmentation (Ronneberger et al., MICCAI 2015)](https://arxiv.org/abs/1505.04597)
*   [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (Devlin et al., NAACL 2019)](https://arxiv.org/abs/1810.04805)
*   [An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT) (Dosovitskiy et al., ICLR 2021)](https://arxiv.org/abs/2010.11929)

### Recent Foundational Models (2021-2023)
*   [Highly accurate protein structure prediction with AlphaFold (Jumper et al., Nature 2021)](https://www.nature.com/articles/s41586-021-03819-2)
*   [CLIP: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., ICML 2021)](https://arxiv.org/abs/2103.00020)
*   [Denoising Diffusion Probabilistic Models (DDPM) (Ho et al., NeurIPS 2020)](https://arxiv.org/abs/2006.11239)
*   [High-Resolution Image Synthesis with Latent Diffusion Models (Stable Diffusion / LDM) (Rombach et al., CVPR 2022)](https://arxiv.org/abs/2112.10752)
*   [Learning Transferable Visual Models From Natural Language Supervision (CLIP) (Radford et al., ICML 2021)](https://arxiv.org/abs/2103.00020)

### Large Language Models (LLMs)
*   [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (Devlin et al., NAACL 2019)](https://arxiv.org/abs/1810.04805)
*   [Language Models are Few-Shot Learners (GPT-3) (Brown et al., NeurIPS 2020)](https://arxiv.org/abs/2005.14165)
*   [Training language models to follow instructions with human feedback (InstructGPT) (Ouyang et al., NeurIPS 2022)](https://arxiv.org/abs/2203.02155)
*   [GPT-4 Technical Report (OpenAI, 2023)](https://arxiv.org/abs/2303.08774)
*   [Llama 2: Open Foundation and Fine-Tuned Chat Models (Touvron et al., 2023)](https://arxiv.org/abs/2307.09288)
*   [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (Devlin et al., NAACL 2019)](https://arxiv.org/abs/1810.04805) - Duplicate removed, keep one.
*   [Language Models are Few-Shot Learners (GPT-3) (Brown et al., NeurIPS 2020)](https://arxiv.org/abs/2005.14165)
*   [Training language models to follow instructions with human feedback (InstructGPT) (Ouyang et al., NeurIPS 2022)](https://arxiv.org/abs/2203.02155)

### Transformers Beyond Text
*   [An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT) (Dosovitskiy et al., ICLR 2021)](https://arxiv.org/abs/2010.11929)
*   [High-Resolution Image Synthesis with Latent Diffusion Models (Stable Diffusion / LDM) (Rombach et al., CVPR 2022)](https://arxiv.org/abs/2112.10752) - Duplicate removed, keep under Generative Models.
*   [Segment Anything (Kirillov et al., ICCV 2023)](https://arxiv.org/abs/2304.02643)
*   [Generative Agents: Interactive Simulacra of Human Behavior (Park et al., 2023)](https://arxiv.org/abs/2304.03442)

### Miscellaneous & Other Notable Papers
*   [Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (Ioffe & Szegedy, ICML 2015)](http://proceedings.mlr.press/v37/ioffe15.html) - Duplicate removed, keep under Optimizers & Normalization.
*   [Deep Residual Learning for Image Recognition (ResNet) (He et al., CVPR 2016)](https://arxiv.org/abs/1512.03385)
*   [U-Net: Convolutional Networks for Biomedical Image Segmentation (Ronneberger et al., MICCAI 2015)](https://arxiv.org/abs/1505.04597)
*   [Highly accurate protein structure prediction with AlphaFold (Jumper et al., Nature 2021)](https://www.nature.com/articles/s41586-021-03819-2)
*   [Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Wei et al., NeurIPS 2022)](https://arxiv.org/abs/2201.11903)
*   [Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Yao et al., NeurIPS 2023)](https://arxiv.org/abs/2305.10601)
*   [Visual Instruction Tuning (LLaVA) (Liu et al., NeurIPS 2023)](https://arxiv.org/abs/2304.08485)