Harshvardhan Mestha

I am an undergraduate student at BITS Pilani, Goa Campus where I’m pursuing Bachelor of Engineering in Electronics and Instrumentation with a Minor in Data Science.

I am also a member of SAiDL, Society for Artificial Intelligence and Deep Learning at BITS Goa; we try to inculcate a spirit of AI and DL in the university through open source projects and personalised courses.

I also love all things related to sci-fi,space,cars,aviation. I also draw and you can find some of my art here.

[linkedin] [github] [email] [alt email] [resume]

news

2nd October 2024

My paper with ESML is under review at the 13th International Conference on Learning Representations !

13th August 2024

Joined the Efficient Scalable Machine Learning (ESML) group as a research intern !

30th June 2024

Our blogpost on Graphormer accepted at GRAM Workshop @ ICML 2024.

24th June 2024

My paper with APPCAIR is under review at the 27th International Conference on Discovery Science !

5th June 2024

My paper is under review at ReScience C 2024 !

1st May 2024

Will be working at NeosAlpha Technologies as an intern for summer 2024.

30th April 2024

Selected for Neuromatch Summer School'24 in the Deep Learning Track.

26th March 2024

Working at FOG Technologies as a Computer Vision intern.

14th December 2023

Working at APPCAIR as a student researcher !

1st October 2023

Only person from our batch to get inducted into SAiDL !

projects

State-space models can learn in-context by gradient descent

Working on using a novel SSM (State Space Model) architecture for language modeling, by using SSMs to emulate gradient descent, and exploring the mechanisms by which SSMs perform in-context learning. Under review at ICLR 2025.

Supervisors: Prof. Anand Subramoney (Royal Holloway, University of London)

[preprint]

RadioLM - Radiology Language Model

Working on a novel pseudo replacement to RLHF through prompting which also involves creating a modified model with this pseudo-RLHF and then using it to find a Human-window of LLM explanation understandability. We are testing this theory on Radiology/Med students.

Supervisors: Prof. Ashwin Srinivasan, Prof. Sidong Liu (Macquarie University, Australia), Prof. Tanmay Verlekar

CountCLIP - [Re] Teaching CLIP to Count to Ten

I conducted a reproducibility study of the paper Teaching CLIP to Count to Ten, published by Google Research, in ICCV 2023. I implemented the paper from scratch and collected a specialized dataset to facilitate the training. In addition to this, I carried out further explorations and analysis of the paper, and wrote a paper on my findings which is currently under review at ReScience C 2024.

[preprint] [code] [dataset]

AutoPAC - Automatic Plan and Code Synthesis

An LLM-based pipeline to apply a idea to resolve challenges in an ML pipeline. AutoPAC models a more realistic setting of incremental development of ML pipelines, resolving the issues in a continual fashion.

Supervisors: Prof. Ashwin Srinivasan, Prof. Gautam Shroff (TCS Research), Prof. Tanmay Verlekar

[preprint]

Visualising Image Generation using Stable Diffusion

I implemented the Stable Diffusion paper from scratch (with the help of this tutorial), and added the functionality to animate the image generation process. More animated generations can be found here.

[repo]

Relevant XKCD

I made a website to find the most relevant XKCD comic based on the prompt you type. Embeddings of the comic explanations were made, and the most relevant comics are fetched using a similarity search. The website is a Flask application.

[repo][link]

[more projects]