Mechanistic Interpretability of LLMs Leveraging Sparse Autoencoders to understand the learning process of an LLM 2025-02-01
MQA Adapt - Adapting MQA for efficient Inference Layerwise Adaptive Multi-Query Attention for efficient inference 2024-12-01
Improving Mini BERT’s predictions using Multilingual Knowledge Distillation Improving mini BERT's performance on Hindi COVID Fake News pred without using Parallel Finetuning data 2021-02-01
Molecular VAE PyTorch implementation of the paper "Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules" 2021-01-01
Neural Question Generation This is an attempt to compare and test various techniques used in seq2seq modelling, on a question generation task 2020-02-01