Hi, I’m Grace.
I work on site reliability and incident operations at Microsoft Azure Engineering Operations (EngOps), previously PM on Datadog’s LLM Observability product. I’m interested in distributed systems, high performance computing, and infrastructure.



























