- Ph.D. in Computer Science, Lehigh University, Aug 2020 - May 2025 (expected)
- Doctor of philosophy: I enable machines that can expand their visual knowledge by interacting with and learning directly from people.
- Advisor: Dr. Brian D. Davison
- Focus area: Machine Learning, Data Science, Information Retrieval
- M.S. in Computer Science, Lehigh University, Aug 2018 - May 2020
- B.E. in Computer Science, Rajiv Gandhi Proudyogiki Vishwavidyalaya, Aug 2013 - June 2018
Incomming Ph.D. Summer Intern, Lawrence Berkeley National Lab, ESnet
- Peer Mentor, Lehigh University (NSF-REU) 2020 (CNS-1757787)
- Mentoring and closely guiding 15 NSF-REU Interns on their respective site projects.
- Duties included: Weekly one-on-one discussions with each intern, understanding their fallouts and helping them out with their codes and other low level academic issues.
- Research Assistant, Resilience Research Group for SARS-CoV-2
- Research Intern, Lawrence Berkeley National Lab, NERSC (link)
- Supervisor: Dr. Brian Austin
- Developed scripts to fetch and analyze terabytes of data from the SLURM scheduler.
- Analyzed & estimated real-time queues in the scheduler for optimizing the policies for incoming jobs.
- Developed three real-time policies that potentially improved the allocation procedure.
- Job Cancellation
- Job Pausing
- Machine Learning Intern, Persistent Systems ltd (link)
- Developed a facial recognition and verification system using Google’s FaceNET research as the baseline.
- Added additional OpenCV features on top of it, which can differentiate between 3-D and 2-D images.
- Designed a purely browser-based RSA compliant module to work with FIDO keys.
- Memory Defense: More Robust Classification via a Memory-Masking Autoencoder (Python, PyTorch, Advertorch) (link)
- We developed a robust autoencoder with one-hot memory masking to mitigate adversarial attacks.
- The proximity approximation model can retrieve an image’s relevant memory features and reconstruct it with a repaired label.
- The enhanced deep neural architecture significantly improved the robustness of DNN for an image classification task.
- Face-Mask Detection on real-world Webcam dataset (Python, Bash, ffmpeg, Pytorch, COCO Annotator)
- Successfully collected more than 900 Gb of public webcam data, by capturing image frames periodically from over 80 webcams across United States.
- Applied Coco-annotation semi-automated labelling to develop ground-truth labels.
- Re-implemented 4 state-of-the-art face detection algorithms for face detection & face mask detection to analyze their effectiveness in real-world dataset.
- Reported face mask usage across United States from Jun 23’ 2020 to Feb 10’ 2020.
- Sequence Generative Adversarial Nets with Policy Gradient (Python, PyTorch) (link)
- Seq-GAN is a unique approach which models the data generator as a stochastic policy in reinforcement learning to solve the problem.
- The RL reward signal comes from the GAN discriminator judged on a complete sequence, and is passed back to the intermediate state- action steps using Monte Carlo search.
- Facial Recognition and Verification System (Python) (link)
- Realtime recognition system focused on mobile devices.
- Overcame the challenge of keeping a low false-positive rate by developing a unique approach learning directly from 128-D embedding into a Euclidean space.
Small Academic Projects
- Image & Text Annotation using Reinforcement Learning (Python) (link)
- In this team project, I build a deep learning model using Inception V3 as my base to extract features from the images and to pass on the feature array vector to the text annotation model in whole to generate a multi-arm bandit algorithm with a challenge of very little labelled data.
- Predicting the attractiveness of the list of products on Amazon (Python) (link)
- The quality of product listing is crucial for improving search relevance and gaining customer attention. I had implemented a neural network model with the help of keras text processing to learn the amazon dataset after feature engineering and cleaning the data.
- Two-Phase Commit Replicated Hash Table (C++, Bash, Amazon EC2) (link)
- Implemented a hash table which ensures distributed atomic transactions in the system. It replicates every operation as backup on separate node for recovery phase.
- Twitter Trend Analysis of data to create high frequency contextual word (C++, MongoDB, HTML5) (link)
- Using twitter data to do analysis and extract the most frequently used dictionary for political campaign. This application has been been implemented in C++ and MongoDB back-end.
- Languages - Python, C++, Bash, Scala
- Tools & Lbraries - Pytorch, Tensorflow, OpenCV, dlib, Boost-C++, Cmake, scikit-learn, git, Latex, Pyspark
Achievements and Honors
- Awarded by Cognizant, for the Outstanding Project Award in Facial Recognition with Deep Neural Network (2016-17)
- Best Project Award by the Department of Computer Science in RGPV university for ’Physical Intrusion Detection System’ (2015-16)
- Ranked among top 1 percentile in TESTimony’16 organised by Tata Consultancy Services. (2015)
- Awarded by H.C.Verma (Experimental Physicist): Winner of the National Level SCEECS’16: Quiz Competition organised by National Institute of Technology, Bhopal (MANIT). (2015)
- Awarded a Trophy from Central Board of Secondary Education (CBSE) for performance in English subject (2013)
- Presented a Abstract Paper on Shell-shock Vulnerability at National level technical symposium in Bhopal: Which explained about how the attack vector works, the risks involved and how to mitigate them. This explanation of the Shell-Shock CVE was published in the National conference Magazine published by LNCT Group of College Bhopal. (2014)
- All India Rank 598 in International Maths-Science Olympiads (2007)
- Deep Learning School 2019 at Lawrence Berkeley National Laboratory
- Introduction in Data Structure Programming: IIT Madras - NPTEL National Programme on Technology Enhanced Learning
- Linux FoundationX: edX - 93% Verify
- Introduction to Programming Using Python: Massachusetts Institute of Technology (MIT)Verify
- Introduction to C++: Microsoft - 98% Verify
Big Data Analytics, Advance Programming Techniques, Introduction to Data Mining, Machine Learning MOOC, Introduction to Artificial Intelligence, Analysis and Algorithm Design, Data Structures, Object Or Computation, Discrete Mathematics, Computer System Organization, Database Management System, Computer Graphics and Multimedia, Computer Network, Software Engineering, Compiler Design, Soft-Computing.
- Vice President of Lehigh University Crytography Club.
- Updated the school’s 10 Year standing record in Triple-Jump in session 2012-2013.
- Won first-prize in JSSC Basket-ball Tournament.
- Won first-prize in 200 mtr Inter-district Athletics’s meet.