About me

Hello, I'm Saurabh Sawant, a passionate technologist seeking full-time roles in Software Engineer or Machine Learning related opportunities. Graduated with distinction in Masters of Science in Computer Science, I thrive on crafting innovative solutions using cutting-edge technologies. From spearheading advanced Generative AI research to delving into challenging projects at Amazon, my journey reflects a commitment to pushing the boundaries of technology and driving impactful innovation.

What i'm doing

  • Web development icon

    Software Development

    Professionally, I architect and engineer scalable, efficient, and distributed systems with cutting-edge technologies.

  • design icon

    Data Science

    I use data analytics, machine learning modelling, and statistical modeling to uncover insights from complex data, guiding data-driven decisions and innovation.

  • Research icon

    Research

    I drive NLP research, using cutting-edge methods to advance language understanding and innovate in text processing and modeling.

  • camera icon

    Photography

    I capture unique perspectives in photography, embracing minimalism, and also explore art illustration and short film creation.

Resume

Education

  1. Arizona State University Tempe, AZ, US

    Masters of Science in Computer Science; GPA: 4.0/4.0 [degree] Jan, 2022 - Dec, 2023

    Foundations of Algorithm, Data Processing at Scale, Deep Learning, Statistical Machine Learning, Data Mining, Artificial Intelligence

  2. Vellore Institute of Technology Chennai, TN, India

    B.Tech in Computer Science and Engineering; GPA: 8.4/10.0 Jul, 2016 - Sep, 2020

    Data Structures and Algorithms, Object-Oriented Programming, Database and Management Systems, Large Scale Data Systems, Machine Learning, Natural Language Processing, Operating Systems, Applied Linear Algebra

Experience

  1. Amazon Seattle, WA, US

    Software Dev Engineer Intern May, 2023 - Aug, 2023
    • Proposed and led the development of an Interactive Data Extraction CI/CD Pipeline for Amazon Science, reducing waiting and processing time by 15% compared to traditional methods.

    • Leveraged a suite of AWS services including Fargate, EC2, and S3 to orchestrate a robust & scalable infrastructure.

    • Went above and beyond by developing a Python module to streamline access within AWS Sagemaker, enhancing accessibility and usability for the Amazon Science team.

  2. Arizona State University Tempe, AZ, US

    Research Assistant (NLP Research) May, 2022 - Dec, 2022
    • Created a Constrained Masked Language Model (MLM) VarBERT, revolutionizing the recovery of meaningful variable names from decompiled source code, achieving remarkable accuracy of 50.70%, surpassing state-of-the-art models. IEEE S&P

    • Pioneered methods to boost the cognitive prowess of LLMs like T5 with instruction learning. Achieved comparable performance using just 10% of the original dataset, marking a breakthrough in natural language understanding. arXiv

    • Demonstrated the limitations of large language models (LLM) like GPT3 and T5 by creating numerical feasibility data set. Created data set and experimentation to show that the GPT3 has accuracy of just 19%. Also, demonstrated the knowledge ingestion inability of such models. EACL 2023

  3. Wipro Ltd. Pune, MH, India

    Machine Learning Engineer Sep, 2020 - Oct, 2021
    • Spearheaded the development of an ML-driven customer service routing system, optimizing the incorrect routing by 18% through the implementation of a hybrid model combining decision tree and neural network algorithms.

    • Achieved a staggering 40% reduction in processing time for transforming millions of rows in Mastercard’s secure customer transaction big data (TBs) by engineering a robust ETL pipeline leveraging Apache Spark, SQL, and Apache Hadoop.

    • Developed batch manager jobs using Unix to schedule and automate Apache Spark and SQL scripts, reducing manual labor by 24 hours per month, and streamlining data processing and enhancing efficiency.

    • Mentored engineers, providing comprehensive guidance in Apache Spark and Python. Established industry best practices, directed career development, and conducted customized technical learning sessions for individual engineers. Additionally, facilitated the seamless integration of new team members into the workplace environment.

  4. Center for Development of Advanced Computing (CDAC) Pune, MH, India

    Machine Learning Engineer Intern May, 2019 - Jul, 2019
    • Introduced and programmed a NER (Named Entity Recognition) model (Bi-LSTM) to route the user to the most relevant customer service professional. The model was leveraged to extract relevant features from the conversation. This implementation reduced the incorrect customer routing by 13%.

My skills

Languages
  • Python icon
    Python
  • Rust icon
    Rust
  • Typescript icon
    Typescript
  • C icon
    C++
  • Java icon
    Java
  • SQL icon
    SQL
  • JavaScript icon
    JavaScript
Frameworks & Libraries
  • AWS icon
    AWS Services
  • AWSCDK icon
    AWS CDK
  • PyTorch icon
    PyTorch
  • TensorFlow icon
    TensorFlow
  • Hugging Face icon
    Hugging Face
  • NumPy icon
    NumPy
  • Pandas icon
    Pandas
  • Flutter icon
    Flutter
Tools & OS
  • PostgresSQL
    PostgresSQL
  • Docker icon
    Docker
  • Apache Hadoop
    Apache Hadoop
  • Apache Spark icon
    Apache Spark
  • MySQL icon
    MySQL
  • Tableau icon
    Tableau
  • HTML icon
    HTML
  • CSS icon
    CSS
  • Unity icon
    Unity
  • Linux icon
    Linux

Certifications

Projects

Research

Contact

Contact Form