Hi, I'm Aayan 👋
Artifical Intelligence/Machine Learning Developer & Researcher
AM

About

Hey there! I'm Aayan and I work on building AI projects, from finetuning models like OdysseyXL to developing systems such as Maverick Search. Whether its deep learning, experimenting with model optimization, or creating something entirely new, I'm always exploring something. When I'm not coding or training models, you'll probably find me watching Formula 1.

Education/Certificates

H

Harvard University - CS50x: Introduction to Computer Science

2025 - Present
This is CS50x , Harvard University's introduction to the intellectual enterprises of computer science and the art of programming for majors and non-majors alike, with or without prior programming experience. An entry-level course taught by David J. Malan, CS50x teaches students how to think algorithmically and solve problems efficiently. Topics include abstraction, algorithms, data structures, encapsulation, resource management, security, software engineering, and web development. Languages include C, Python, SQL, and JavaScript plus CSS and HTML. Problem sets inspired by real-world domains of biology, cryptography, finance, forensics, and gaming.
H

Hugging Face - LLM Course

2025 - Present
The Hugging Face LLM Course is a practical, hands-on introduction to working with large language models (LLMs) using the Hugging Face ecosystem. It covers key concepts like transformers, tokenization, model inference with pipelines, fine-tuning on custom datasets, evaluation, and deployment. Designed for developers and ML practitioners with basic Python skills, the course teaches how to leverage state-of-the-art models for tasks such as summarization, classification, and translation. Learners get to explore the Hub, train and share models, and optimize inference with tools like Text Generation Inference (TGI) and accelerate, all through interactive Colab notebooks and real-world examples.
T

TAFE NSW - Introduction to Artificial Intelligence

2025 - 2025
Completed a 2.5-hour self-paced online Microskill course introducing the fundamentals of Artificial Intelligence, with no prior technical knowledge required. Gained foundational understanding of how AI learns from data, explored real-world applications across various industries, learned key AI terminology, and received insights from industry professionals on starting a career in AI. The course also addressed common myths and misconceptions surrounding AI. Successfully completed all modules and assessments to earn a certificate of completion.
C

CodeSignal - Building Neural Networks with PyTorch

2024 - 2025
Master PyTorch with this learning path, designed for those experienced in Python and machine learning. From tensor basics to advanced modeling, it includes practical exercises focused on real-world datasets, such as the wine dataset, enhancing your deep learning skills through PyTorch.
S

Sololearn - Python Developer

2024 - 2025
Python is the world's fastest growing programming language is easy to read, learn and code. You'll learn to build interactive programs and automate your tasks, analyze and visualize even the most complex data and create AI and machine learning models. No previous coding experience needed.

Skills

Python
Vercel
NumPy
PyTorch
TensorFlow
scikit-learn
Docker
Keras
GCP
Azure
AWS
Pandas
Unsloth
Transformers
Diffusers
PEFT
Jupyter Notebooks
My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

OdysseyXL

OdysseyXL

Fine-tune of Stability.ai's SDXL text-to-image model for enhanced realism and better image generation

SDXL
Low-Code
Stability.ai
Cloud Training
Diffusers
Python
Open-Neo

Open-Neo

Open-Neo is an Australia based research lab dedicated to advancing open-source AI models

Python
Low-Code
text-to-text
NLP
Research
NoemaCoder

NoemaCoder

NoemaCoder is a SOTA coding LLM that excels in many enviourments. Based on Bytedance-Seed's SeedCoder-8B, providing an excelent base which outperforms models such as Alibaba's QwQ-32B and Deepseek AI's Deepseek-R1

Python
Unsloth
text-to-text
Code
NLP
Research
Llama
Projects/Publications

I like building things

  • V

    Vera-V1: Enhancing Multilingual Language Models with Group Relative Policy Optimisation (GRPO)

    This is a research project which I led the development of the Vera model family. The purpose of this research project was to analyse how we can improve non-reasoning multilingual LLMs through reinforcement specifically Group Relative Policy Optimisation (GRPO).
  • M

    Maverick Search

    Maverick Search is an open-source AI search engine designed to run locally. Any local model can be used through Ollama Maverick Search uses Exa Search API
Contact

Get in Touch

Want to chat? Just shoot me a dm with a direct question on twitter and I'll respond whenever I can. I will ignore all soliciting.