At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk. Location preference the Silicon Valley area We’re looking for an experienced Site Reliability Engineer to join ou... more details
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.
Location preference the Silicon Valley area
We’re looking for an experienced Site Reliability Engineer to join our team. At IBM, the Software Defined Networking (SDN) business which includes IBM Hybrid Cloud Mesh, NS1 and other offerings focuses on software based networking, an architecture approach that enables network to be intelligently and centrally controlled using software with main focus on automating network functions, allowing for simpler provisioning and management of network resources, everywhere from the data center to the campus to the edge.
The initial responsibility will be focused on improving our observability systems, but we are always looking for people who can help level-up our teams in multiple areas. We believe that a true SRE culture should allow people to focus on wherever the most improvements are needed, rather than being siloed or dedicated to one set of applications day in and day out.
Ideally, you’ll bring experience with:
Configuration management and infrastructure-as-code experience (Terraform and Ansible preferred)
Collaborating with product development engineers to identify, implement and report on service level indicators and objectives
Software development and scripting (GoLang preferred)
Deploying and troubleshooting complex, global production systems
Multiple hosting models preferred (managed, colo, and AWS/multi-cloud)
Admin-level Linux skills
3+ years of hands-on experience creating SaaS applications working in the production operation of a company whose primary products are SaaS applications.
Minimum of 5 to 7 years' experience in hands-on global production system deployment, administration and troubleshooting
Proven experience in systems performance analysis and debugging in a Linux environment
Experience in software development and scripting: bash and python are required (golang preferred)
Experience in automation is required
2+ year’s Experience with provisioning and configuration management systems (terraform, ansible) across multiple cloud providers
2+ years Experience in observability and alerting systems, splunk, ELK, open telemetry or similar systems
2+years experience in working with different cloud providers such as IBM Cloud, AWS, Azure, GCP
3+years Experience with operating systems running on Kubernetes / Openshift platforms.
Experience on Postgres DBA and kafka (or similar)
Collaborating with product development engineers to identify, implement and report on service level indicators and objectives
Willingness to participate in an on-call rotation.
Experience with the following would be an asset:
Working on integration and delivery systems such as Jenkins
Containerized applications
Experience with remote bare metal hardware provisioning. PXE boot, working with remote hands
Job Abstracts is an independent Job Search Engine. Job Abstracts is not an agent or representative and is not endorsed, sponsored or affiliated with any employer. Job Abstracts uses proprietary technology to keep the availability and accuracy of its job listings and their details. All trademarks, service marks, logos, domain names, and job descriptions are the property of their respective holder. Job Abstracts does not have its members apply for a job on the jobabstracts.com website. Additionally, Job Abstracts may provide a list of third-party job listings that may not be affiliated with any employer. Please make sure you understand and agree to the website's Terms & Conditions and Privacy Policies you are applying on as they may differ from ours and are not in our control.
Any time you conduct a search, the system shows you job matches, ranked by their Relevance Score (RS).
The score is calculated by a proprietary algorithm that uses Intelligent Machine Learning.
The Relevance Score tells you how well the job opportunity matches your search term or terms.
When not logged in, the system is limited to one search term. Scores for single term matches are usually lower.
When you register, log in, and set up multiple terms prioritized by importance, the jobs found for you will receive a much higher Relevance Score.