Living Timeline

This page is a living timeline that brings together my long-form writing, project launches, and a few personal milestones in one chronological stream.

By listing articles and project releases side by side, you can trace how ideas move from early notes into published posts or shipped software over time. Each entry is tagged with the domains involved β€” from LLM systems and reinforcement learning to the occasional life event that influenced the work.

Everything is ordered from newest to oldest, with upcoming experiments included to highlight what's currently in the pipeline.

Future plans are clearly marked so you can see what might be coming next.

Activity Feed

2025

  1. Article No link Upcoming

    Reinforcement learning roundup (probably)

    Collecting field research for a late-2025 write-up once I know what sticks.

  2. Project No link Upcoming

    RL sandbox prototypes

    Experimenting with a grab bag of RL tools to match the article whenever the real scope reveals itself.

  3. Article
    Contextual Bandit Theory: Regret Bounds and Exploration (opens project or article)

    Understand the theory behind contextual bandits: regret bounds, the exploration-exploitation tradeoff, reward models, and why certain algorithms work. Math that directly informs practice.

    • contextual-bandits
    • machine-learning-theory
    • regret-bounds
    • exploration-exploitation
  4. Article
    When to Use Contextual Bandits: The Decision Framework (opens project or article)

    Stop running month-long A/B tests that leave value on the table. Learn when contextual bandits are the right choice for adaptive, personalized optimizationβ€”and when to stick with simpler alternatives.

    • contextual-bandits
    • reinforcement-learning
    • online-learning
    • personalization
  5. Article
    Beyond the Vibe Check: A Systematic Approach to LLM Evaluation (opens project or article)

    Stop relying on gut feelings to evaluate LLM outputs. Learn systematic approaches to build trustworthy evaluation pipelines with measurable metrics, proven methods, and production-ready practices. A practical guide covering faithfulness vs helpfulness, LLM-as-judge techniques, bias mitigation, and continuous monitoring.

    • llm-evaluation
    • machine-learning
    • rag-systems
    • ai-quality
    • systematic-testing
    • production-ml

2024

  1. Bit
    Attention Mechanism (opens project or article)

    Deep Learning flashcard

    • Deep Learning
  2. Bit
  3. Bit
  4. Bit
  5. Bit
  6. Bit
  7. Bit
  8. Bit
  9. Life No link

    Became a father

    Our first child, Benjamim, arrived on September 12, 2024 β€” and everything changed in the best possible way. I hit pause on the deep-learning roadmap to start collecting training data from the tiniest (and most fascinating) human dataset I’ll ever work with. These days, midnight diaper shifts feel a lot like reinforcement-learning loops β€” except the reward signal is a sleepy giggle that makes every iteration worth it. I still catch myself jotting notes in our β€œfamily lab notebook,” half scientist, half dad, completely in awe.

2023

  1. Project
    Large Language Models with MLX (opens project or article)

    I explored chat tooling on Apple Silicon using MLX to understand the runtime and packaging story.

    • llms
    • mistral
    • lamma2
  2. Project
    LoRA and DoRA Implementation (opens project or article)

    I implemented LoRA and DoRA from scratch in PyTorch to understand the methods end to end.

    • llms
    • peft
    • pytorch
  3. Article
    OpenELM Notes (opens project or article)

    I wrote about OpenELM and how Apple approaches efficient language models.

    • OpenELM
    • research
    • paper
  4. Project
    RAG with LlamaIndex, Elasticsearch, Llama3 (opens project or article)

    I built a retrieval pipeline with LlamaIndex, Elasticsearch, and Llama3 to test local deployment workflows.

    • Elasticsearch
    • LlamaIndex
    • Llama3