TD Bank Site Reliability Engineer (US) in New York, New York
Site Reliability Engineer (US)
About TD Bank, America's Most Convenient Bank®
TD Bank, America's Most Convenient Bank, is one of the 10 largest banks in the U.S., providing more than 8 million customers with a full range of retail, small business and commercial banking products and services at approximately 1,300 convenient locations throughout the Northeast, Mid-Atlantic, Metro D.C., the Carolinas and Florida. In addition, TD Bank and its subsidiaries offer customized private banking and wealth management services through TD Wealth®, and vehicle financing and dealer commercial services through TD Auto Finance. TD Bank is headquartered in Cherry Hill, N.J. To learn more, visit www.tdbank.com. Find TD Bank on Facebook at www.facebook.com/TDBank and on Twitter at www.twitter.com/TDBank_US .
TD Bank, America's Most Convenient Bank, is a member of TD Bank Group and a subsidiary of The Toronto-Dominion Bank of Toronto, Canada, a top 10 financial services company in North America. The Toronto-Dominion Bank trades on the New York and Toronto stock exchanges under the ticker symbol "TD". To learn more, visit www.td.com .
Auto req ID:
Job Profile Summary
The Site Reliability Engineer provides technical leadership and integrated guidance across business, product and technology teams/partners to improve the design and operation of systems, making them secure, stable, scalable, fault tolerant, resilient, observable and efficient while ensuring performance and high availability.
The role sets the direction and influences the development and implementation of production systems and services to address emerging business needs and resiliency strategies while advancing the overall design architecture and technology capabilities in accordance with technology standards, and industry developments. SREs considers the performance, resiliency, fault tolerance and stability of production systems their primary focus, yet at the same time is committed to designing scalable and operational improvement through the application of software engineering practices.
- Must be eligible for employment under regulatory standards applicable to the position.
Provides technical leadership to improve the design and operation of systems in alignment reliability engineering best practices and overall Technology and Bank strategies, applying the practices of computer science and software engineering to the design and development of large, complex systems
Drives and influences integrated DevOps solutions across business, product, platform, infrastructure, development, support/DevOps teams that improve the design and operation of systems, making them scalable, reliable, and efficient while ensuring performance and high availability of products/services
Ensures availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of products/service(s) including enterprise systems that may serve multiple services and applications/segments
Influences and partners with key technology and product team members in the design and development of solutions that promote automation, innovation and the elimination of toil; identify optimal ways to improve the design and operation of systems to make them more scalable, more reliable, and more efficient and have the ability to implement the required changes
Defines and prioritizes problems to solve with applications/products/services and respective systems and drives the resolution/remediation across technology areas
Balances engineering and development priorities, providing expertise on automating the systems of their respective services and/or applications, and coding complex fixes and solutions in response to a major issue, toil or a new product/service feature
Has ownership of the strategic planning for capacity and its provisioning activities
Develops deep relationships with Product Owners, Ops, Tech Leads and Executives to build transparency and help foster end/end accountability of products and services
Works in close partnership with technology teams to support TD's business objectives and operational support goals providing domain expertise on strategic Infrastructure as well as Business project related activities (including both Change the Bank and Run the Bank programs)
Engages executive stakeholders appropriately to review progress and obtain input, validation and approval of key decisions
Anticipates client needs to identify appropriate solutions and to influence the development of innovative solutions
Ensures adherence of Operational (Production) Readiness practices of respective products and services
Sets service-level objectives (SLO) that defines availability of a particular product or service and exercise key decision rights of the SRE role (e.g. supporting release to production, rejecting software that is operationally substandard and directing developers to improve the code etc.)
Implements the observability requirements to monitor and assure that our systems measure to the expected service levels and perform with the appropriate operational characteristics
Focuses on reliability, scalability, and the development of the production computing infrastructure; including highly complex and scalable systems
Develops observability standards to ensure that production systems operate under known conditions and transparently provides these measurements to anticipate when errors or failures can arise.
Engineers solutions through problem post-mortem reviews to ensure that problem solutions are complete and that errors will not manifest again.
Anticipates internal and external business challenges, helping teams find solutions through continuously improving on process and technologies
Leads interaction with governance and control groups, (e.g. regulatory / operational risk, compliance and audit) to provide subject matter expertise and consult on risk issues related to Engineering technology and tools
Depth & Scope:
Expert Site Reliability Engineering role with comprehensive expertise in leading-edge theories, engineering practices, extensive coding and scripting
Advanced and highly specialized knowledge of TD applications, systems, networks, innovation models, design activities, best practices, business / organization, Bank standards, and may fulfill a governance role
Engineering specialist assigned to work autonomously on high profile, complex and/or high-risk technology initiatives with significant impact to the organization
Provides technical leadership / consulting / direction to multiple businesses and product teams, growing capability across the organization
Resolves unique and complex problems that have a broad impact on the business
Authoritative expert on site reliability issues within area of specialization
Understands the journey of an enterprise transformation where there is a hybrid cloud/non-cloud operating model.
Drives end/end accountability of products and services across the enterprise through collaboration and transparency
Primarily works at the product umbrella, segment, LOB level
Typically reports to the Site Reliability practice executive
At TD, we are committed to fostering an inclusive, accessible environment, where all employees and customers feel valued, respected and supported. We are dedicated to building a workforce that reflects the diversity of our customers and communities in which we live in and serve, and creating an environment where every employee has the opportunity to reach their potential.
If you are a candidate with a disability and need an accommodation to complete the application process, email the TD Bank US Workplace Accommodations Program at USWAPTDO@td.com . Include your full name, best way to reach you, and the accommodation needed to assist you with the application process.
EOE/Minorities/Females/Veterans/Individuals with Disabilities/Sexual Orientation/Gender Identity.
Education & Experience:
University degree in Computer Science or related technical field involving systems engineering or equivalent practical experience.
10+ years of engineering experience (e.g. Software or platform)
1 Vanderbilt Avenue Corporate
TD Bank AMCB
Job Category - Primary:
Site Reliability Engineering
For additional information regarding the compensation of this position, please click here
For an overview of TD's Benefits program, please visit TD's Total Rewards site (https://hrportal.ehr.com/tdtotalrewards)
Federal law prohibits job discrimination based on race, color, sex, sexual orientation, gender identity, national origin, religion, age, equal pay, disability and genetic information.