Shreya Sharma

About Me

Hi! I'm currently pursuing Masters of Science in Computer Science at University of Massachusetts Amherst, with my courses focused on furthering my understanding of Data Science and Systems. I'm passionate about bridging data science, engineering and analytics, building robust pipelines that make insights scalable, interpretable, and genuinely useful.

I like to read and draw on my spare time

I'm always open to new opportunities to learn and explore new concepts through practical implementation to further my understanding of the ever-evolving technical space.

  • Age 24
  • Present Address 1040 N Pleasant St, Amherst, MA
  • University Email ssharma0@umass.edu
  • Personal Email shuureiyashi99@gmail.com
  • Phone +1 413 425 3293
  • LinkedIn/shreya-sharma-31aa86215
  • GitHub@uchuuronin

What I have done

Data Science & Analytics

Exploratory analysis, clustering, anomaly detection, and visual storytelling with pandas, scikit-learn, plotly and machine learning models.

Data Engineering

Worked with tools such as Kafka, Redis, Storm, Pyspark, etc to assist in data analytics workflows on AWS (EMR/S3) and Google Cloud

Data-Driven Reliability

Investigating anomalies and correlated behaviour across systems using statistical and causal reasoning to improve resilience and detection accuracy.

Research & Projects

Applying statistical computing, retrieval-augmented analysis, and structured QA ideas to real-world systems datasets.

Fun Facts

2+ Years

in industry

∞ Curiosity

for learning

Ongoing Growth

via experiences

Journey

Experience

Jan 2023 – Aug 2025
Juniper Networks (Mist) · Bangalore, India

Software Engineer II

  • Expanded RRM anomaly detection with new outlier checks on RRM and AP behavior, enabling real-time identification of poor channels and faulty APs.
  • Developed and deployed a Slackbot integrated into Storm topology for anomaly spike alerts and real-time QA/Engineering visibility.
  • Facilitated live anomaly trend monitoring through an Elasticsearch pipeline and environment dashboards used by QA and Dev teams.
  • Conducted AP health analysis across ~1M+ time-series records using statistical and probabilistic modeling to uncover interference, utilization, and firmware issues.
  • Validated zero-client detection cases by designing statistical thresholds and flagging only performance-impacting anomalies.
Jan 2023 – Jun 2023
Juniper Networks (Mist) · Bangalore, India

Software Engineer Intern

  • Built AP system panic/reboot anomaly framework using EDA + LSTM; uncovered low-memory failure patterns.
  • Automated HTML reporting with Pandas/PySpark; EMR → (later) Airflow uploads to cloud storage.
Jun 2022 – Jul 2022
Andritz · Bangalore, India

Software Development Intern

  • Developed multithreaded C++ TCP/IP client–server for secure machine ↔ control app communication; live checks & fault recovery.
  • Added START/STOP controls, periodic pings, and auto-reconnect for reliability under concurrent load.
  • Prototyped real-time dashboards (C#, AJAX, C3.js) for trainers/engineers.
Jul 2021 – Aug 2021
Nebula Cloud Solutions · Skopje (Remote)

Remote Data Analyst

  • Collected multi-year pollution/weather data (Python, Selenium, Requests); built structured time-series datasets.
  • Created interactive visualizations (Tableau, Plotly, Seaborn) to reveal pollution–weather trends.

Education

2025 – Present
University of Massachusetts Amherst

MS in Computer Science (Data Science)

Fall 2025 courses: Systems for Data Science (COMPSCI 532), Information Retrieval (COMPSCI 646), Statistical Computing (STAT 535).

2019 – 2023
Manipal Institute of Technology

B.Tech, Computer Science & Engineering

Minor: Computer Graphics & Visualization • CGPA: 8.69/10 (~3.7/4).
Coursework: Distributed Systems, Computer Vision, Digital Image Processing, AR/VR, HCI, Software Testing, Engineering Math.

2017 – 2019
Chittagong Grammar School

CIAE A Levels

Results: 2 A*, 1 A, 1 B • Subjects: CS, Mathematics, Physics, Chemistry.

Technical Skills

  • Python
  • Java
  • C++
  • SQL
  • HTML/CSS/JS

Data & ML

  • Pandas
  • PySpark
  • scikit-learn
  • TensorFlow
  • Statistical/Probabilistic Modeling
  • Anomaly Detection

Cloud & Big Data

  • AWS EMR
  • Amazon S3
  • GCP
  • Apache Storm
  • Kafka
  • Airflow
  • Redis
  • Elasticsearch

Visualization & Reporting

  • Plotly
  • Matplotlib
  • Seaborn
  • Tableau
  • C3.JS

Tools & Other

  • Selenium
  • OpenCV
  • Slackbot API
  • Git
  • Figma
  • Unity
  • Blender
  • Jira
  • Confluence

Personal Projects

AP Behaviour Anomaly Detection (Internship Report)
Internship project, attaching only report-presentation done for University for reference. Using LSTM for Anomaly detection based on train/test, as well as Correlation through Mutual Information

AP Behaviour Anomaly Detection (Internship Report)

data-ml
Material Image Classification & Recyclability
Explored recyclable material classification using OpenCV, preprocessing and trained a CNN (TensorFlow) to predict recyclability

Material Image Classification & Recyclability

cv, data-ml
Customer Feedback Sentiment Analyzer
Built NLP pipelines with TF–IDF and LLM-based models to analyze product review sentiment and summarize customer insights.

Customer Feedback Sentiment Analyzer

nlp
Gamification Impact on Learning Outcomes (Research)
Designed a structural equation model to study effect of gamification on self-learning outcomes; Conducted empirical analysis via Duolingo and Google Forms. Research Project under Pranav S Joshi (Assistant Professor) w. Aditya Gunturu and Akshat Taneja

Gamification Impact on Learning Outcomes (Research)

research, hci
Information Retrieval Course Research (Ongoing)
To further improve how queries are processed and documents retreived from a corpus using RAG. Primarily worked on the reranking exploration, setup and implementation. Done as a part of research related coursework for CS646

Information Retrieval Course Research

research, nlp
Reddit Stock Market Analysis (Statistical Computing)
Statistical analysis investigating relationship between Reddit ticker mentions and stock movements. Applied regression, Granger causality tests, and time-series analysis in R. Found Reddit mentions correlate with volatility but not directional returns, particularly for Big Tech stocks. Utilized web scraping, regex-based text processing, and data visualization as a part of STAT535.

Reddit & Stock Market Statistical Analysis

data-ml, research
Cryptocurrency Streaming Pipeline (Systems)
End-to-end distributed streaming system for real-time cryptocurrency analytics for course CS532. My contribution was building WebSocket producers (Coinbase/Binance), Kafka message queue with 4 partitions, stream processor with knobs, and doing load, unit and integrity testing. Project had checkpoint-based recovery and multiprocessing optimization for claim processing workloads.

Cryptocurrency Streaming Data Pipeline

data-ml

Certificates

Coursera

  • 1. Python for Data Science & AI
  • 2. Python Project for Data Science
  • 3. Intro to C# & Unity
Multiple credentials

Coursera Project Network

  • 1. Learn MySQL Fundamentals
  • 2. Machine Learning with Docker
  • 3. Components in Figma
Project certificates

Udemy

  • 1. AI in Digital Marketing
  • 2. Python Programming & Software Design
Course certificates

Workshops & Others

  • 1. Hands-On Arduino (MIT Innovation Centre)
  • 2. AI/ML with R (HT India Labs)
  • 3. Build Your Own Redis (Codecrafters.io)
Workshops

Leadership & Volunteering

IAESTE India

Head, Consular & Member Affairs (National Committee)
2021–2022

IAESTE LC Manipal

Finance-In Charge
2020–2021

Research Society Manipal

Expertise Head, Humanities
2021–2022

ACM–W

Core Committee Member
2020–2022

IDXA Manipal

Founding Board Member
2022–2023

VISION Student Project

AR Subdivision Member
2020–2021

The Duke of Edinburgh Volunteering

Gold Community Service Award
2018

Queen’s Commonwealth Essay Competition

Silver Award, Senior Category
2017

Feel free to reach out!

Amherst, MA

413-425-3293

ssharma0@umass.edu

How Can I Help You?