Job offer
Sr. Principal Infra Info Srvcs (SRE)
The Sr. Principal Infra Info Srvcs (SRE) job at Northern Trust involves leading the design and architecture of complex systems to ensure reliability, scalability and performance, as well as working with teams to continuously improve platform reliability. The focus is on implementing observability solutions and automation to increase the efficiency of services.
Job description
Tasks
- System design and architecture: leading the design and architecture of systems to ensure reliability, scalability and performance of critical complex systems
- Operational excellence: development and maintenance of automation scripts and tools to optimize operations and reduce manual tasks; monitoring of system performance visibility
- Incident response/root cause analysis: cooperation in root cause analysis and implementation of measures to prevent recurrence of problems
- Monitoring and observability: Design and implement comprehensive monitoring and observability solutions to proactively identify and address issues before they impact the business
- Development and maintenance of dashboards and alerts to provide real-time insights into system health
- Reliability improvements: Identify opportunities to improve system reliability through process optimizations and technical solutions
- Documentation and communication: creation and maintenance of detailed documentation of systems, processes and procedures; effective communication with stakeholders at various levels within the organization
- Project Management/Collaboration: manage and prioritize multiple projects and initiatives related to reliability and performance improvements; collaborate with product, development and operations teams to align SRE efforts with overall business objectives
Requirements
- Bachelor's degree or equivalent experience
- 10+ years of experience in systems engineering with a focus on reliability, system operations and software engineering
- 5+ years of experience as a team leader or technical manager with hands-on experience driving projects to completion
- Strong knowledge of programming languages such as Python, Go, Ruby, Java, etc.
- Experience with on-prem and cloud solutions
- Experience with containerization
- Current experience in leading a cross-functional team
- Ability to implement automation for corrective actions based on deployed monitoring solutions
- Practical experience in an agile development environment
We offer
- A flexible and collaborative working culture
- An organization with financial strength and stability
- Opportunities for further development within the company
- A workplace with a higher purpose
- Reasonable accommodation for people with disabilities
Job details