Job Description
Are you ready to make an impact at DTCC? Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We are committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world that we serve. The Information Technology group delivers secure, reliable technology solutions that enable DTCC to be the trusted infrastructure of the global capital markets. The team delivers high-quality information through activities that include development of essential, building infrastructure capabilities to meet client needs and implementing data standards and governance. Pay and Benefits: Competitive compensation, including base pay and annual incentive Comprehensive health and life insurance and well-being benefits, based on location Pension / Retirement benefits Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being. DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee). The Impact you will have in this role: Being a key member of the Reliability Architecture organization, the Director of Enterprise Observability Architecture provides strategic leadership for enterprise-wide observability initiatives, ensuring DTCC platforms and applications operate with regulatory-grade visibility, resilience, and operational continuity. This role defines the north-star observability vision, influences architectural direction, embeds telemetry and resilience into modernization programs, and partners with senior stakeholders across engineering, infrastructure, SRE, security, risk, and business operations. Aligned to DTCC's mission of delivering secure and reliable market infrastructure, this role ensures observability capabilities (metrics, logs, traces, events, dashboards, data health, and automated remediation) are consistently designed, adopted, and governed across the enterprise. Your Primary Responsibilities: Shape and champion DTCC's enterprise observability strategy, ensuring alignment with operational resilience, business continuity, and regulatory expectations Define multi-year roadmaps for observability modernization, including OpenTelemetry adoption, enhanced signal correlation, and AIOps-enablement Establish enterprise-wide architectural standards, patterns, and controls for telemetry, monitoring, alerting, visualization, and retention Drive platform-engineering approaches that deliver observability as a scalable, self-service capability for application and infrastructure teams Ensure all critical production services are instrumented for real-time visibility that connects technical health to business impact Influence senior leadership through clear communication of observability risks, maturity, and strategic investment options Integrate data observability into analytics ecosystems to support regulatory reporting, risk analytics, and client-impact transparency Guide engineering teams in embedding observability throughout the SDLC, including NFR testing, architecture reviews, and operational readiness Lead the design of event-correlation and alerting frameworks that reduce noise, accelerate incident triage, and enable automated remediation Define enterprise dashboards that provide 360 visibility into service reliability, transaction flows, and business-processing health Maintain an enterprise observability architecture covering metrics, logs, traces, events, RUM, data pipelines, and telemetry governance Author policies, standards, and procedures for monitoring, alerting, logging, visualization, and retention Partner with platform, cloud, and infrastructure engineering to integrate observability into modernization and cloud-adoption strategies Present architecture strategies and program health to senior technology and business leaders Lead enterprise assessments, failure-mode analysis, chaos engineering practices, and post-incident improvement cycles Translate telemetry insights into business-level narratives that inform risk, resilience, and operational decision-making NOTE: The Primary Responsibilities of this role are not limited to the details above Qualifications Minimum 10 years of related experience Bachelor's degree in a technical field