TD Bank Jobs

Job Information

TD Bank Site Reliability Engineer (US) in New York, New York


Job Title:

Site Reliability Engineer (US)

TD Description:

About TD Bank, America's Most Convenient Bank®

TD Bank, America's Most Convenient Bank, is one of the 10 largest banks in the U.S., providing more than 8 million customers with a full range of retail, small business and commercial banking products and services at approximately 1,300 convenient locations throughout the Northeast, Mid-Atlantic, Metro D.C., the Carolinas and Florida. In addition, TD Bank and its subsidiaries offer customized private banking and wealth management services through TD Wealth®, and vehicle financing and dealer commercial services through TD Auto Finance. TD Bank is headquartered in Cherry Hill, N.J. To learn more, visit Find TD Bank on Facebook at and on Twitter at .

TD Bank, America's Most Convenient Bank, is a member of TD Bank Group and a subsidiary of The Toronto-Dominion Bank of Toronto, Canada, a top 10 financial services company in North America. The Toronto-Dominion Bank trades on the New York and Toronto stock exchanges under the ticker symbol "TD". To learn more, visit .

Auto req ID:


Department Overview:

Job Profile Summary

The Site Reliability Engineer provides technical leadership and integrated guidance across business, product and technology teams/partners to improve the design and operation of systems, making them secure, stable, scalable, fault tolerant, resilient, observable and efficient while ensuring performance and high availability.

The role sets the direction and influences the development and implementation of production systems and services to address emerging business needs and resiliency strategies while advancing the overall design architecture and technology capabilities in accordance with technology standards, and industry developments. SREs considers the performance, resiliency, fault tolerance and stability of production systems their primary focus, yet at the same time is committed to designing scalable and operational improvement through the application of software engineering practices.


United States

Job Requirements:

  • Must be eligible for employment under regulatory standards applicable to the position.

Customer Accountabilities:

Provides technical leadership to improve the design and operation of systems in alignment reliability engineering best practices and overall Technology and Bank strategies, applying the practices of computer science and software engineering to the design and development of large, complex systems​

Drives and influences integrated DevOps solutions across business, product, platform, infrastructure, development, support/DevOps teams that improve the design and operation of systems, making them scalable, reliable, and efficient while ensuring performance and high availability of products/services ​

Ensures availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of products/service(s) including enterprise systems that may serve multiple services and applications/segments​

Influences and partners with key technology and product team members in the design and development of solutions that promote automation, innovation and the elimination of toil; identify optimal ways to improve the design and operation of systems to make them more scalable, more reliable, and more efficient​ and have the ability to implement the required changes

Defines and prioritizes problems to solve with applications/products/services and respective systems and drives the resolution/remediation across technology areas​

Balances engineering and development priorities, providing expertise on automating the systems of their respective services and/or applications, and coding complex fixes and solutions in response to a major issue, toil or a new product/service feature​

Has ownership of the strategic planning for capacity and its provisioning activities ​

Develops deep relationships with Product Owners, Ops, Tech Leads and Executives to build transparency and help foster end/end accountability of products and services

Works in close partnership with technology teams to support TD's business objectives and operational support goals providing domain expertise on strategic Infrastructure as well as Business project related activities (including both Change the Bank and Run the Bank programs) ​

Engages executive stakeholders appropriately to review progress and obtain input, validation and approval of key decisions

Anticipates client needs to identify appropriate solutions and to influence the development of innovative solutions ​

Shareholder Accountabilities:

Ensures adherence of Operational (Production) Readiness practices of respective products and services​

Sets service-level objectives (SLO) that defines availability of a particular product or service and exercise key decision rights of the SRE role (e.g. supporting release to production, rejecting software that is operationally substandard and directing developers to improve the code etc.)​

Implements the observability requirements to monitor and assure that our systems measure to the expected service levels and perform with the appropriate operational characteristics

Focuses on reliability, scalability, and the development of the production computing infrastructure; including highly complex and scalable systems​

Develops observability standards to ensure that production systems operate under known conditions and transparently provides these measurements to anticipate when errors or failures can arise.

Engineers solutions through problem post-mortem reviews to ensure that problem solutions are complete and that errors will not manifest again.

Anticipates internal and external business challenges, helping teams find solutions through continuously improving on process and technologies

Leads interaction with governance and control groups, (e.g. regulatory / operational risk, compliance and audit) to provide subject matter expertise and consult on risk issues related to Engineering technology and tools



Job Description:

Depth & Scope:

Expert Site Reliability Engineering role with comprehensive expertise in leading-edge theories, engineering practices, extensive coding and scripting

Advanced and highly specialized knowledge of TD applications, systems, networks, innovation models, design activities, best practices, business / organization, Bank standards, and may fulfill a governance role

Engineering specialist assigned to work autonomously on high profile, complex and/or high-risk technology initiatives with significant impact to the organization

Provides technical leadership / consulting / direction to multiple businesses and product teams, growing capability across the organization

Resolves unique and complex problems that have a broad impact on the business

Authoritative expert on site reliability issues within area of specialization

Understands the journey of an enterprise transformation where there is a hybrid cloud/non-cloud operating model.

Drives end/end accountability of products and services across the enterprise through collaboration and transparency

Primarily works at the product umbrella, segment, LOB level

Typically reports to the Site Reliability practice executive


At TD, we are committed to fostering an inclusive, accessible environment, where all employees and customers feel valued, respected and supported. We are dedicated to building a workforce that reflects the diversity of our customers and communities in which we live in and serve, and creating an environment where every employee has the opportunity to reach their potential.

If you are a candidate with a disability and need an accommodation to complete the application process, email the TD Bank US Workplace Accommodations Program at . Include your full name, best way to reach you, and the accommodation needed to assist you with the application process.

EOE/Minorities/Females/Veterans/Individuals with Disabilities/Sexual Orientation/Gender Identity.


New York


New York


Education & Experience:

University degree in Computer Science or related technical field involving systems engineering or equivalent practical experience.

10+ years of engineering experience (e.g. Software or platform)

Work Location:

1 Vanderbilt Avenue Corporate

Business Line:


Job Category - Primary:

Technology Solutions

Job Category(s):

Technology Solutions

**Province/State (Primary):

New York

City (Primary):

New York

Job Family:

Site Reliability Engineering

Time Type:

Full Time

Employment Type:


Hours/Availability Detail:


Work Remotely:


Compensation Information:

For additional information regarding the compensation of this position, please click here

Benefits Information:

For an overview of TD's Benefits program, please visit TD's Total Rewards site (

Federal law prohibits job discrimination based on race, color, sex, sexual orientation, gender identity, national origin, religion, age, equal pay, disability and genetic information.