Hi, I’m Grace.

I work on reliability and incident management at Microsoft Azure Engineering Operations (EngOps), previously PM on Datadog’s LLM Observability product. I’m interested in distributed systems, high performance computing, and infrastructure.
Grace Gong

Experience

Microsoft 2024 - Present
Reliability Engineer 2026 - Present
Azure Engineering Operations
Product Manager 2024 - 2026
Azure Data · Power BI
Product Manager Intern Datadog Winter 2024
Software Engineer Intern Ontario Teachers Pension Plan Fall 2023
Enterprise Architecture
Software Engineer Intern GoDaddy Summer 2023
Domain Auctions UI
Program Manager Intern Microsoft Fall 2022 · Winter 2023
Finance & Compliance
BI Engineer Intern Scotiabank Summer 2022
Trade Operations
Data Analyst Intern Loblaw Summer 2021
Specialty Health Network
Researcher UNESCO 2020 - 2022
Media & Technology
Founder, Executive Director Youth180° 2019 - 2024
Youth Leadership Non-Profit
Business Operations Manager Local Business 2014 - 2022

Projects

LLM Training on TPUs with JAX & vLLM
LLM Training on TPUs with JAX & vLLM 2026
Fine-tuned a domain-specific LLM on TPUs using JAX & deployed via vLLM for inference.
JAX·vLLM·TPU·LLM·Python
Deprecation of Legacy Excel & CSV Import
Product on the Excel & CSV Import experience in Power BI Service & modernization plan.
Power BI·Azure
Power BI MCP Agentic Development
Built Agentic Development in Power BI Workflow & Demo. Featured at FABCON 2026.
Power BI·MCP·Agentic AI·Azure
Power BI Desktop Power BI Desktop hover
Product on Power BI Desktop. Power BI has 30M+ Monthly Active Users. Presented at GHC 2025.
Power BI·Azure
TMDL Visual Studio Code Extension
Product on the TMDL Visual Studio Code Extension with 45K+ installs.
Power BI·VS Code·TMDL·Azure
Semantic Model Refresh Templates in Power BI
Product on the Fabric Data Pipelines integration in Power BI. Featured at FABCON 2025.
Power BI·Fabric·Azure
ML/LLM Observability ML/LLM Observability hover
Product on Machine Learning / Large Language Model Observability.
LLM·Observability·Python·Datadog
Deep Learning Project | RBC Borealis AI
Deep learning model for earthquake detection & research paper for RBC's Borealis AI institute.
TensorFlow·PyTorch·Azure·Nvidia
Google Cloud Skillsboost Review
PRD detailing research-driven methods to improve user engagement for GCP.
Product Management·Design·GCP
C-Horse Language
A custom programming language designed for children, featuring 'Recipe' based functions and simple syntax.
C++
RippleChat | Google Software Product Sprint
A mental health support chatbot.
Dialogflow·HTML/CSS·Javascript·Google Cloud
MLH Prep Fellowship
Weather application and other features built with team.
Ruby·Jekyll·React·Netlify
Eye Assist | SheHacks 'Architect Better Future'
Assistive tech app using OpenCV & eye-tracking to empower users with limited mobility.
Flutter·OpenCV·Computer Vision
SpeechAssist | 2nd Place Overall
Mobile interpretation app for the visually & hearing impaired using real-time TensorFlow.
Flutter·Tensorflow·Mobile Development
Musio | WaffleHacks Best Art & Music
Music collaboration platform using WebRTC for real-time interaction & CockroachDB for data storage.
React.js·WebRTC·Node.js·CockroachDB
Futerview | 3rd Overall & Best Use of AI
Interview prep site for marginalized communities via AI-driven feedback & accessibility features.
React·AssemblyAI·Design
Robota | 3rd Overall
Job-matching platform to assist displaced civilians in finding immediate work opportunities.
Figma·Product Management·Design
Cryptogal | Best Finance Hack
Financial analysis dashboard with real-time insights & data visualization for crypto trends.
Python·Pandas·APIs
Care Connect | Best Accessibility Hack
Service-matching platform connecting users with local non-profits via location and category filtering.
PostgreSQL·Express·React·Node.js
F3stival | Best Domain Name
Exploring the metaverse through a specialized concert platform built on the DeSo protocol.
Design·Web Development·React
Moodji | Best Domain Name
Mental health tracker that identifies emotional patterns and provides actionable wellness tips.
Dart·Ruby
UNESCO Research Pt.1
Research publication on youth-focused policy & post-COVID recovery solutions for UNESCO.
Python·Data Analysis·Research
UNESCO Research Pt.2
Followup research publication to understand youth realities and advance concrete solutions.
Python·Data Analysis·Research
mEco | Equity & Sustainability
iOS platform routing users to local businesses & offering sustainability tools for entrepreneurs.
iOS / Swift·Google Maps API
PinkRibbons | Girls in Tech
Mobile application improving access to breast cancer preventive care specifically for women of color.
Java·Android
Northstar | Hack it together
Personalized discovery engine for finding curated books, movies, and music recommendations.
React·Google Cloud·APIs
Youth180 | Non-Profit Community
Website for my non-profit dedicated to youth empowerment & community resources.
HTML/CSS·Javascript·Google Cloud Platform

Education

Master of Science, Computer Science

Georgia Institute Of Technology

Master of Science, Management

University of Illinois Urbana-Champaign

Bachelor of Science, Computer Science

Western University
Venture Capital University · Berkeley Executive Education
Dale Carnegie Training · University of Central Missouri
Code in Place Section Leader · Stanford University