Resources

This document contains all the external resources, links, and references mentioned in the “Exploring Generative AI Models (Part 1)” lecture.

Foundational Papers

Google Colab Notebooks

All demo notebooks from the presentation:

Development Platforms & Tools

Google Colab

Hugging Face

API Providers & Model Access

OpenAI

Anthropic

OpenRouter

  • Website: https://openrouter.ai
  • Unified API to hundreds of AI models through a single endpoint
  • Compatible with OpenAI’s Chat Completions API format
  • Access to OpenAI, Claude, Gemini, Grok, Nova, Llama, DeepSeek, Qwen, and more

Local Model Hosting Tools

LM Studio

  • Website: https://lmstudio.ai
  • Desktop application for running LLMs locally
  • Supports GGUF quantized models
  • Built on llama.cpp

Ollama

  • Website: https://ollama.ai
  • Command-line tool for running LLMs locally
  • Simple model management and deployment

llama.cpp

Major Language Models

GPT Series (OpenAI)

  • GPT-1 (June 2018): 117M parameters
  • GPT-2 (Feb 2019): 1.5B parameters
  • GPT-3 (May 2020): 175B parameters
  • GPT-3.5 / ChatGPT (Nov 2022): RLHF-tuned
  • GPT-4 series (2023+): Multimodal capabilities

Llama Series (Meta)

  • Llama 1 (Feb 2023): 7B-65B parameters, researcher access only
  • Llama 2 (Jul 2023): First open-weights commercial license
  • Llama 3 series (2024+): Improved performance and scale

Gemma Series (Google)

Other Notable Models

  • Mistral: Open-weight models from Mistral AI
  • OLMo: Fully open-source model from AI2 (Allen Institute for AI)
  • DeepSeek, Qwen: Chinese open-weight models

Key Concepts & Techniques

RLHF (Reinforcement Learning from Human Feedback)

  • Technique for fine-tuning models to follow instructions
  • Used in InstructGPT, ChatGPT, and Claude
  • Human raters rank model responses to train a reward model

Quantization

  • GGUF Format: GPT-Generated Unified Format
    • Single-file architecture supporting 2-bit to 8-bit quantization
    • Developed by llama.cpp community
  • MLX Format: Apple’s ML framework for Apple Silicon
    • Supports 4-bit and 8-bit quantization
    • Released late 2023

Additional Learning Resources

API Documentation

Communities

Model Parameter Comparison

Text Models (Approximate Sizes)

  • 1B parameters: ~2GB
  • 4B parameters: ~8.6GB
  • 12B parameters: ~23GB
  • 70B parameters: ~140GB (full precision)
  • 175B parameters (GPT-3): ~350GB (full precision)

Note: Quantization can reduce these sizes by 50-75% with minimal quality loss