Hi, I'm Aayan 👋
Artifical Intelligence/Machine Learning Developer
AM

About

Hey there! I'm Aayan and I work on building AI projects, from finetuning models like OdysseyXL to developing systems such as Maverick Search. Whether its deep learning, experimenting with model optimization, or creating something entirely new, I'm always exploring something. When I'm not coding or training models, you'll probably find me watching Formula 1.

Education/Certificates

H

Harvard University - CS50x: Introduction to Computer Science

2025 - Present
This is CS50x , Harvard University's introduction to the intellectual enterprises of computer science and the art of programming for majors and non-majors alike, with or without prior programming experience. An entry-level course taught by David J. Malan, CS50x teaches students how to think algorithmically and solve problems efficiently. Topics include abstraction, algorithms, data structures, encapsulation, resource management, security, software engineering, and web development. Languages include C, Python, SQL, and JavaScript plus CSS and HTML. Problem sets inspired by real-world domains of biology, cryptography, finance, forensics, and gaming.
M

Massachusetts Institute of Technology - 6.S191 Introduction to Deep Learning

2025 - Present
MIT's introductory program on deep learning methods with applications to natural language processing, computer vision, biology, and more! Students will gain foundational knowledge of deep learning algorithms, practical experience in building neural networks, and understanding of cutting-edge topics including large language models and generative AI.
C

CodeSignal - Building Neural Networks with PyTorch

2024 - 2025
Master PyTorch with this learning path, designed for those experienced in Python and machine learning. From tensor basics to advanced modeling, it includes practical exercises focused on real-world datasets, such as the wine dataset, enhancing your deep learning skills through PyTorch.
S

Sololearn - Python Developer

2024 - 2025
Python is the world's fastest growing programming language is easy to read, learn and code. You'll learn to build interactive programs and automate your tasks, analyze and visualize even the most complex data and create AI and machine learning models. No previous coding experience needed.

Skills

Python
Vercel
NumPy
PyTorch
TensorFlow
scikit-learn
Docker
Keras
GCP
Azure
AWS
Pandas
Unsloth
Transformers
Diffusers
PEFT
Jupyter Notebooks
My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

Kyro-n1

Kyro-n1

Kyro-n1 is a lightweight and fast reasoning model based on Qwen/Qwen2.5-3B-Instruct.

Python
LLMs
Transformers
Qwen2.5
Unsloth
OdysseyXL

OdysseyXL

Fine-tune of Stability.ai's SDXL text-to-image model for enhanced realism and better image generation

SDXL
Low-Code
Stability.ai
Cloud Training
Diffusers
Python
Open-Neo

Open-Neo

Open-Neo is an Australia based research lab dedicated to advancing open-source AI models

Python
Low-Code
text-to-text
NLP
Research
Athena

Athena

Athena is an LLM that is based on Alibaba's Qwen2.5 family that is designed for STEM tasks and general NLP tasks

Python
Unsloth
text-to-text
image-text-to-text
Multimodal LLM
NLP
Research
Projects/Code

I like building things

  • M

    Maverick Search

    Maverick Search is an open-source AI search engine designed to run locally. Any local model can be used but Athena-3 models are optimised with this code. Maverick Search uses Exa Search API
Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.