Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Data Science Internship
JIO Data Science Platform, Reliance JIO
Defence for Face-morphing Adversarial Attacks on Facial Recognition Systems
Course project - CS726 : Advanced Machine Learning - Guide : Prof. Sunita Sarawagi
Toxicity Removal in Large Language Models
Course project - CS772 : Deep Learning for Natural Language Processing - Guide : Prof. Pushpak Bhattacharya
Multi-armed Bandits and Markov Decision Processes for Gaming
Course project - CS747 : Foundations in Intelligent and Learning Agents (Reinforcement Learning) - Guide : Prof. Shivaram Kalyanakrishnan
Neuromorphic Computing and Spiking Neural Networks for Real Time Learning
Course project - EE746 : Neuromorphic engineering - Guide : Prof. Udayan Ganguly
Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Course project - DH602 : Machine Learning and Statistical Methods in Healthcare - Guide : Prof. Kshitij Jadhav
Transformer based model for Seizure detection in EEG data
Course project - DH302 : Public Health Informatics - Guide : Prof. Kshitij Jadhav - Secured an AP grade
Facial Feature Detection using the Fastai Library
Course project - DS303 : Introduction to Machine Learning - Guide : Prof. Biplab Banerjee
Automated Recognition, Processing and Sentiment Analysis of Speech
Course project - CS753 : Automatic Speech Recognition - Guide : Prof. Preethi Jyothi - Best Presentation Award
Wavelet transforms for image super-resolution and restoration
Course project - EE610 : Image Processing - Guide : Prof. Amit Sethi - Secured an AP grade
Jigsaw puzzle solver using Computer Vision Techniques
Course project - CS763 : Computer Vision - Guide : Prof. Sharat Chandran
Cardiovascular Disease Prediction Using Data Science Techniques
Course project - DS302 : Programming for Data Science - Guide : Prof. Amit Sethi
Neural Style Transfer for Text and Chat - Season of Code Project
Completed a Summer Project under the Web and Coding club
Reflow Oven for Soldering SMD Components on PCBs
Course project - EE344 : Electronic Design Lab - Guide : Prof. Siddharth Tallur - Best Project Award
IITB RISC ’22 - A Multi-cycle Processor
Course project - EE309 : Microprocessors - Guide : Prof. Virendra Singh
Data Structures and Algorithms - Summer of Science Project
Completed a study report and video presentation under the Math and Physics club
LASSO - A Gaming Program
Course project - CS101 : Computer Programming and Utilization - Guide : Prof. Bhaskaran Raman
Fine Grained Classification and Image Denoising using CNNs
Course project - GNR638 : Machine Learning for Remote Sensing - Guide : Prof. Biplab Banerjee
Analysis of Linear Complexity Distribution of Functions
Course project - EE793 : Topics in Cryptology - Guide : Prof. Virendra Sule
Socket Programming
Course project - CS224 : Computer Networks - Guide : Prof. Vinay Ribeiro
Digital Filter Design and ECG analysis
Course project - EE338 : Digital Signal Processing - Guide : Prof. Vikram Gadre
Speech processing and recognition
Course project - EE678 : Speech Processing - Guide : Prof. Preeti Rao
Nuclei Instance Segmentation and Classification for Histopathology Images
Bachelor’s Project-I (Collaboration with TATA Cancer Research Hospital) at the Medical Deep Learning and Artificial Intelligence Lab, IIT Bombay, Guide: Prof. Amit Sethi
CT Reconstruction from Ultrasound Images of Breast Cancer using GANs
Research Assistant (Collaboration with TATA Cancer Research Hospital) at the Medical Deep Learning and Artificial Intelligence Lab, IIT Bombay, Guide: Prof. Amit Sethi
Computer Vision Techniques for Vocal Fold Surgery Assistance
Research Internship (MITACS GRI Award) at the Medical Computer Vision and Robotics Lab, University of Toronto, Onsite: May’23 - Jul’23, Guide: Prof. Lueder Kahrs University of Toronto
Genomics-based Survival Analysis for Lung Cancer using Multimodal Data
Bachelor’s Project-II (Collaboration with TATA Cancer Research Hospital) at the Medical Deep Learning and Artificial Intelligence Lab, IIT Bombay, Guide: Prof. Amit Sethi
Hallucination Detection and Mitigation in Large Language Models
Research Assistant at the Crowd Dyanamics Lab, UIUC, Guide: Prof. Hari Sundaram
Multilingual Automatic Speech Recognition for Low-resource Languages
Master’s Thesis-I (Nationwide project - Bhashini, NLTM and the Amazon IITB AI-ML Initiative) at the Computational Speech and Language Technologies Lab, IIT Bombay, Guides: Prof. Preethi Jyothi, Prof. Pushpak Bhattacharya
Large Language Model-based metric for Automatic Speech Recognition
Master’s Thesis-I (Nationwide project - Bhashini, NLTM) at the Computational Speech and Language Technologies Lab, IIT Bombay, Guide: Prof. Preethi Jyothi
Low-resource and Dialectal Speech Generation
Master’s Thesis-II (Nationwide project - BharatGen) at the Computational Speech and Language Technologies Lab, IIT Bombay, Guides: Prof. Preethi Jyothi, Prof. Ganesh Ramakrishnan
publications
Artificial Intelligence-based Eosinophil Count in Gastrointestinal Tract Biopsy
Poster presented in the American Gatroenterology Association meet (DDW), Chicago and published in the Gastroenterology journal, 2023
An eosinophila detection model conquering severe class imbalance built using UNet architecture.
Recommended citation: Shah H.C., Amarpurkar A.D., Jacob T., Parulekar A.M. and Sethi A. (2023). EP178 ARTIFICIAL INTELLIGENCE BASED EOSINOPHIL COUNT IN GASTROINTESTINAL TRACT BIOPSY. Gastroenterology, 164(6), pp.S-1229.
Download Paper | Download Slides
Towards improving breast cancer detection through multi-modal image generation
Published in the Ultrasonics journal, 2023
Interconversion of CT scans and ultrasounds using wave interference patterns, GANs and fourier domain adaptation.
Recommended citation: Almahfouz Nasser S, Sharma A, Saraf A, Parulekar A, Haria P, Sethi A. Towards improving breast cancer detection through multi-modal image generation. Ultrasonics. 2025 Sep;153:107655. doi: 10.1016/j.ultras.2025.107655. Epub 2025 Apr 15. PMID: 40262439.
Download Paper
Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification
Published and presented in Bioimaging (BIOSTEC), Rome, 2024
A novel loss function and training technique that can be integrated with a multitude of architectures, for consolidating class labels of different nuclei segmentation and classification datasets.
Recommended citation: Parulekar A., Kanwat U., Gupta R., Chippa M., Jacob T., Bameta T., Rane S. and Sethi A. (2024). Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification. In Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOIMAGING; ISBN 978-989-758-688-0, SciTePress, pages 281-288. DOI: 10.5220/0012380800003657
Download Paper | Download Slides
A Computer Vision Pipeline for Laryngoscopic Image Standardization through Histogram Matching
Poster presented in the 145th American Laryngological Association meet (COSM), Chicago, 2024
A preprocessing pipeline for larynogoscopic videos that includes removal of unusable frames, illumination correction, specularity removal and finally color transfer to a target intensity distribution.
Recommended citation: Parulekar A., Wiercigroch J., Kahrs L. A. and Lin R. J. (2024). A Computer Vision Pipeline for Laryngoscopic Image Standardization through Histogram Matching.
Download Slides
Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR
Published at the 4th Multilingual Representation Learning Workshop, EMNLP 2024, 2024
Combining computationally efficient techniques like speech-based parameter-efficient finetuning and text-only adaptation to improve automatic speech recognition of low resource languages using multimodal multilingual models.
Recommended citation: Abhishek Gupta, Amruta Parulekar, Sameep Chattopadhyay, and Preethi Jyothi. 2024. Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR. In Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024), pages 175–185, Miami, Florida, USA. Association for Computational Linguistics.
Download Paper | Download Slides
PathoGen-X: A Cross-Modal Genomic Feature Trans-Align Network for Enhanced Survival Prediction from Histopathology Images
Published at IEEE ISBI 2025 (International Symposium on Biomedical Imaging), 2025
Developed PathoGen-X, a transformer-based framework that translates histopathology image features into the genomic feature space for improved survival prediction without requiring paired genomic data at testing.
Recommended citation: A. Krishna, N. C. Kurian, A. Patil, A. Parulekar, P. J. P and A. Sethi, "Pathogen-X: A Cross-Modal Genomic Feature Trans-Align Network for Enhanced Survival Prediction from Histopathology Images," 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI), Houston, TX, USA, 2025, pp. 1-4, doi: 10.1109/ISBI60581.2025.10981028.
Download Paper | Download Slides
AMPS: ASR with Multimodal Paraphrase Supervision
Published at NAACL 2025 (Main conference), 2025
Integrated paraphrase supervision in a multimodal pipeline to improve Automatic Speech Recognition for spontaneous and disfluent speech.
Recommended citation: Abhishek Gupta, Amruta Parulekar, Sameep Chattopadhyay, and Preethi Jyothi. 2025. AMPS: ASR with Multimodal Paraphrase Supervision. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 404–413, Albuquerque, New Mexico. Association for Computational Linguistics.
Download Paper | Download Slides
LASER: An LLM-based ASR Scoring and Evaluation Rubric
Published at EMNLP 2025 (Main conference), 2025
LASER is an LLM-based ASR evaluation metric that aligns closely with human judgments by capturing linguistic nuances across Indian languages better than traditional metrics.
Recommended citation: Amruta Parulekar and Preethi Jyothi. 2025. LASER: An LLM-based ASR Scoring and Evaluation Rubric. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 24773–24782, Suzhou, China. Association for Computational Linguistics.
Download Paper | Download Slides
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Undergraduate Teaching Assistant - BB101 - Biology
Semester : Autumn 2021, IIT Bombay, Instructor : Prof. Ambarish Kunwar and Prof. Hari Verma, 2021
Facilitating smooth course organization, grading papers, mentoring students, conducting tutorials and help sessions
Undergraduate Teaching Assistant - ME119 - Engineering Graphics and Drawing
Semester : Spring 2022, IIT Bombay, Instructor : Prof. Sushil Mishra, 2022
Facilitating smooth course organization, grading papers, mentoring students, conducting tutorials and help sessions
Undergraduate Teaching Assistant - CS753 - Automatic Speech Recognition
Semester : Spring 2024, IIT Bombay, Instructor : Prof. Preethi Jyothi, 2024
Facilitating smooth course organization, grading papers, mentoring students, conducting tutorials and help sessions
Undergraduate Teaching Assistant - CS725 - Foundations of Machine Learning
Semester : Autumn 2024, IIT Bombay, Instructor : Prof. Preethi Jyothi, 2024
Facilitating smooth course organization, grading papers, mentoring students, conducting tutorials and help sessions
