Current jobs related to Snr SRE Engineer - Bandar Baru Bangi, Selangor - RHB Banking Group
-
Senior Electrical Engineer
2 days ago
, Jln Ampang, Kampung Baru, Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Malaysia Ramboll Full time 80,000 - 120,000 per yearCompany Description About RambollFounded in Denmark, Ramboll is a foundation-owned people company. We have more than 18,000 experts working across our global operations in 35 countries. Our experts are leaders in their fields, developing and delivering innovative solutions in diverse markets including Buildings, Transport, Planning & Urban Design, Water,...
-
Senior Infrastructure Engineer
2 days ago
First Ave, Bandar Utama, Petaling Jaya, Selangor, Malaysia Grab Full time 80,000 - 120,000 per yearCompany Description About Grab and Our WorkplaceGrab is Southeast Asia's leading superapp. From getting your favourite meals delivered to helping you manage your finances and getting around town hassle-free, we've got your back with everything. In Grab, purpose gives us joy and habits build excellence, while harnessing the power of Technology and AI to...
Snr SRE Engineer
2 weeks ago
Primary Objective
Drive SRE practice and deliver the highest and industry leading level of system and infrastructure resiliency that meets business and regulatory requirements. This role also require Software Engineering knowledge to perform development work to automate manual processes or identify potential issues with applications.
Key Responsibilities
- Drive consistent SRE practice across all application, infrastructure and IT security teams
- Set up and operationalize SRE teams identified for specific application, infrastructure and IT security areas
- Provide coaching for SRE related functions to SRE engineers and other team (application and infrastructure support teams) practicing SRE within Group Technology to ensure consistent practice of SRE across teams.
- Contribute to the development and documentation of SRE best practices and procedures across the Group.
- Take ownership of Application Monitoring tools such as Dynatrace and work with vendors to design and drive consistent use of the monitoring tools across all teams
- Design, develop, and deploy automation scripts and tools to monitor, manage, and optimize systems.
- Analyze system metrics and logs from Dynatrace or other monitoring tools to identify potential problems and areas for improvement.
- Build internal expertise in Application Monitoring tools in order to continuously support and enhance observability across all relevant areas as technology and business environment changes
- Train and enable active use of Application Monitoring tools across all application and infrastructure support teams
- Provide support in deep analysis and trouble-shooting of technical issues encountered in the Critical and Required High applications and the underlying supporting infrastructure and IT security components. This applies during normal times and during incident / system downtime.
- Advocate and develop a strong culture of system resiliency and delivery of non-functional requirements
- Support, validate and sign off delivery of SRE-related non-functional requirements during project implementation
- Continue to fine-tune and enhance SRE practice as business and technology environment evolves.
- Keep abreast with issues and challenges encountered in system reliability and identify strategic / structural changes that need to be made to improve
- Build strong teamwork and collaboration between SRE, Application, IT Infrastructure and all other relevant stakeholders within Group Technology
- Promote continuous learning and culture of innovation within the team
- Build strategic and mutually-beneficial relationship with technology solution partners and service providers to further strengthen the Group's capabilities
Requirements
- Master's Degree - Master/ Degree in Computer Science, IT or a related discipline.
- 8 - 10 years in IT system development & implementation experience in Financial Services Industry (FSI)
- 3 - 5 years in system architecture and design related experience
- Programming Languages: Proficiency in key application programming languages such as Java, .NET C#, Python and scripting languages (e.g. Bash, Powershell) is required. Additionally, knowledge of other languages like Cobol can be helpful. Willing to learn new emerging programming languages as well is a plus.
- Knowledge on various databases such as MSSQL, Oracle, NoSQL etc is required.
- Knowledge of mainframe architecture, operations, and management is a plus. This includes understanding z/OS, CICS ,CICS Transaction Gateway, and other mainframe-specific technologies.
- Systems Reliability: Familiarity with principles of systems reliability, including monitoring, automation, and incident management.
- Networking: Understanding of networking concepts, protocols, and troubleshooting.
- Strong experience in designing and delivering non-functional requirements including High Availability (at hardware and software levels), Disaster Recovery, Archiving, Housekeeping, Backup and Recovery etc.
- Experience and strong appreciation in SRE practice including Service Level Objectives, Service Level Indicators, System Observability, Elimination of Toils, Automation etc.
- Excellent interpersonal and communication skills and highly influential in driving strong SRE culture
- Strong analytical and problem solving skills
- Strong R&D mindset