Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions to accelerate the process of doing business and simplify people’s lives. With intelligent agreement management, Docusign unleashes business-critical data that is trapped inside of documents. Until now, these were disconnected from business systems of record, costing businesses time, money, and opportunity. Using Docusign’s Intelligent Agreement Management platform, companies can create, commit, and manage agreements with solutions created by the #1 company in e-signature and contract lifecycle management (CLM).
Operating a reliable global service requires robust observability and automation. The Observability team builds the real‑time telemetry and AI/ML capabilities that power how engineers measure, visualize, investigate, and improve customer experience—at planetary scale.
This position is an individual contributor role reporting to the Engineering Manger for Observability.
Own end‑to‑end pipelines—from ingestion and feature engineering to training and real‑time/batch inference—for operational time‑series use cases (metrics, logs, events)
Design and operate anomaly‑detection & forecasting services (statistical/ML) that improve SLAs, reduce alert fatigue, and accelerate incident triage
Build internal SDKs, metadata catalogs, and reusable ingestion frameworks that standardize access, enforce governance, and accelerate adoption across product and platform teams
Harden data quality (sanity scoring, validation, drift checks), backfills, and replay strategies; define SLOs for data and models
Partner with Applied Scientists to productionize models with pragmatic algorithms (e.g., tree‑based methods, classical TS) and selectively introduce deep learning where it pays off
Implement CI/CD for data & models (tests, canaries/shadowing, monitoring/evals, safe rollback)
Optimize time‑series storage and compute (e.g., ClickHouse, Postgres/TimescaleDB, columnar stores): partitioning, rollups, retention, and cost controls
Integrate with the observability stack (OTel for signals; dashboards/alerts) and collaborate with SRE/infra to ensure multi‑tenant performance and resilience
Explore LLM‑assisted on‑call (RAG over logs/runbooks) to improve diagnosis and guidance; manage prompt safety, evals, and latency budgets
Hybrid:
Employee divides their time between in-office and remote work. Access to an office location is required. (Frequency: Minimum 2 days per week; may vary by team but will be weekly in-office expectation)
Positions at Docusign are assigned a job designation of either In Office, Hybrid or Remote and are specific to the role/job. Preferred job designations are not guaranteed when changing positions within Docusign. Docusign reserves the right to change a position's job designation depending on business needs and as permitted by local law.
-
12+ years of software engineering, with deep Python and SQL in production
-
Hands‑on building real‑time and batch pipelines with stringent SLAs and data quality controls
-
Proven experience in time‑series analysis, anomaly detection, and forecasting for operational systems
-
Experience deploying containerized services (Docker), Linux fundamentals, and CI/CD
-
Proficiency with at least one time‑series/analytical store (ClickHouse, Postgres/TimescaleDB, columnar stores), plus caching/NoSQL where appropriate
-
Experience with workflow orchestration (Prefect, Airflow, or Dagster)
-
Streaming platforms (Kafka/Redpanda/Pulsar) or equivalent messaging; schema management and idempotent/exactly‑once strategies
-
Model serving monitoring (custom FastAPI/Flask services, MLflow, KServe/Seldon); drift detection; shadow/canary rollouts
-
Familiarity with observability tooling (OpenTelemetry, Prometheus, Grafana) and alerting best practices (SLOs, MTTR/MTTA)
-
Exposure to LLMs and Hugging Face; interest in applying LLMs to ops guidance (RAG over telemetry/runbooks)
-
Distributed systems fundamentals; cloud experience (GCP/AWS/Azure) and IaC
-
Kubernetes experience or willingness to ramp quickly
Docusign is committed to building trust and making the world more agreeable for our employees, customers and the communities in which we live and work. You can count on us to listen, be honest, and try our best to do what’s right, every day. At Docusign, everything is equal.
We each have a responsibility to ensure every team member has an equal opportunity to succeed, to be heard, to exchange ideas openly, to build lasting relationships, and to do the work of their life. Best of all, you will be able to feel deep pride in the work you do, because your contribution helps us make the world better than we found it. And for that, you’ll be loved by us, our customers, and the world in which we live.
Docusign is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. If you need such an accommodation, or a religious accommodation, during the application process, please contact us at
accommodations@docusign.com.
If you experience any issues, concerns, or technical difficulties during the application process please get in touch with our Talent organization at
taops@docusign.com for assistance.