![](assets/dp2.jpg)
Harshvardhan Mestha
I am an undergraduate student at BITS Pilani, Goa Campus where I’m pursuing Bachelor of Engineering in Electronics and Instrumentation with a Minor in Data Science.
I am also a member of SAiDL, Society for Artificial Intelligence and Deep Learning at BITS Goa; we try to inculcate a spirit of AI and DL in the university through open source projects and personalised courses.
I also love all things related to sci-fi,space,cars,aviation. I also draw and you can find some of my art here.
I am also a member of SAiDL, Society for Artificial Intelligence and Deep Learning at BITS Goa; we try to inculcate a spirit of AI and DL in the university through open source projects and personalised courses.
I also love all things related to sci-fi,space,cars,aviation. I also draw and you can find some of my art here.
news
2nd October 2024
My paper with ESML is under review at the 13th International Conference on Learning Representations !
13th August 2024
Joined the Efficient Scalable Machine Learning (ESML) group as a research intern !
30th June 2024
Our blogpost on Graphormer accepted at GRAM Workshop @ ICML 2024.
24th June 2024
My paper with APPCAIR is under review at the 27th International Conference on Discovery Science !
5th June 2024
My paper is under review at ReScience C 2024 !
1st May 2024
Will be working at NeosAlpha Technologies as an intern for summer 2024.
30th April 2024
Selected for Neuromatch Summer School'24 in the Deep Learning Track.
26th March 2024
Working at FOG Technologies as a Computer Vision intern.
14th December 2023
Working at APPCAIR as a student researcher !
1st October 2023
Only person from our batch to get inducted into SAiDL !
projects
![](assets/demos/gdssm/gdssm.png)
State-space models can learn in-context by gradient descent
Working on using a novel SSM (State Space Model) architecture for language modeling, by using SSMs to emulate gradient descent, and exploring the mechanisms by which SSMs perform in-context learning.
Under review at ICLR 2025.
Supervisors: Prof. Anand Subramoney (Royal Holloway, University of London)
Supervisors: Prof. Anand Subramoney (Royal Holloway, University of London)
[preprint]
![](assets/demos/radiolm/radiolm.gif)
RadioLM - Radiology Language Model
Working on a novel pseudo replacement to RLHF through prompting which also involves creating a modified model with this pseudo-RLHF and then using it to find a Human-window of LLM explanation understandability. We are testing this theory on Radiology/Med students.
Supervisors: Prof. Ashwin Srinivasan, Prof. Sidong Liu (Macquarie University, Australia), Prof. Tanmay Verlekar
Supervisors: Prof. Ashwin Srinivasan, Prof. Sidong Liu (Macquarie University, Australia), Prof. Tanmay Verlekar
![](assets/demos/countclip/sample.gif)
CountCLIP - [Re] Teaching CLIP to Count to Ten
I conducted a reproducibility study of the paper Teaching CLIP to Count to Ten, published by Google Research, in ICCV 2023.
I implemented the paper from scratch and collected a specialized dataset to facilitate the training.
In addition to this, I carried out further explorations and analysis of the paper, and wrote a paper on my findings which is currently under review at ReScience C 2024.
![](assets/demos/autopac/autopac.png)
AutoPAC - Automatic Plan and Code Synthesis
An LLM-based pipeline to apply a idea to resolve challenges in an ML pipeline. AutoPAC models a more realistic setting of incremental development of ML pipelines, resolving the issues in a continual fashion.
Supervisors: Prof. Ashwin Srinivasan, Prof. Gautam Shroff (TCS Research), Prof. Tanmay Verlekar
Supervisors: Prof. Ashwin Srinivasan, Prof. Gautam Shroff (TCS Research), Prof. Tanmay Verlekar
[preprint]
![](assets/demos/sd/cherry_blossom.gif)
Visualising Image Generation using Stable Diffusion
I implemented the Stable Diffusion paper from scratch (with the help of this tutorial),
and added the functionality to animate the image generation process. More animated generations can be found here.
[repo]
![](assets/demos/xkcd.gif)