{"id":54860,"date":"2025-09-24T16:16:42","date_gmt":"2025-09-24T09:16:42","guid":{"rendered":"https:\/\/bestarion.com\/us\/?p=54860"},"modified":"2025-09-24T16:41:52","modified_gmt":"2025-09-24T09:41:52","slug":"senior-devops-site-reliability-engineer-job-description","status":"publish","type":"post","link":"https:\/\/bestarion.com\/us\/senior-devops-site-reliability-engineer-job-description\/","title":{"rendered":"SENIOR DEVOPS\/ SITE RELIABILITY ENGINEER"},"content":{"rendered":"<p><strong>Bestarion<\/strong> is a subsidiary of Larion, a well-established software outsourcing company in Vietnam with decades of experience delivering high-quality technology solutions. Inheriting Larion\u2019s strong foundation and technical expertise, Bestarion continues to grow as a trusted partner for clients worldwide.<\/p>\n<p>For over 15 years, Bestarion has provided innovative outsourcing services and business solutions to successful clients in more than 15 countries. Our diverse range of services includes Big Data &amp; Data Analytics, Securities Trading Solutions, Surround Core Banking Solutions, E-commerce and Social Network App Development, and Web Application Development. We focus on today&#8217;s emerging trends such as Big Data, Cloud Computing, Social Networks, Mobility, and the Internet of Things.<\/p>\n<ul>\n<li><strong>Location:<\/strong> QTSC Building, 3rd Floor, 1 Quang Trung, Software City, HCMC<\/li>\n<li><strong>Working Location:<\/strong> Remote or working in Bestarion office\/ <span data-teams=\"true\">US onsite opportunity<\/span><\/li>\n<li><strong>Working Time:<\/strong>\n<ul style=\"list-style-type: circle;\">\n<li>Monday &#8211; Friday, 8:00 AM &#8211; 5:30 PM (Flexible depending on each project)<\/li>\n<li><strong>1-hour daily standup Tuesday-Friday<\/strong>, likely from 9 PM to 10 PM VNT.<\/li>\n<li><strong>Expectation to Travel to USA<\/strong>: The expectation is 1 &#8211; 4 trips\/year, with each trip lasting 1-2 weeks.<\/li>\n<li><strong>Maintenance Work Hours<\/strong>: The resource will need to work USA hours for three days every three months to perform maintenance on key production systems.<\/li>\n<\/ul>\n<\/li>\n<li><strong>About the project:<\/strong> We&#8217;re looking for a skilled and motivated DevOps\/Site Reliability Engineer (SRE) to join our growing team. In this exciting role, you will be responsible for building and maintaining our cloud infrastructure, automating our CI\/CD pipelines, and ensuring the reliability, performance, and scalability of our services. The ideal candidate will have a strong background in both software development and systems engineering, with a focus on GCP and automation tools, and a strong sense of ownership.<\/li>\n<\/ul>\n<h2>JOB DESCRIPTIONS<\/h2>\n<ul>\n<li>Design and manage infrastructure on Google Cloud Platform (GCP) using Terraform for Infrastructure as Code (IaC).<\/li>\n<li>Build, configure, and maintain CI\/CD pipelines using Jenkins and Groovy scripts to automate software delivery from code commit to production deployment.<\/li>\n<li>Manage Jenkins plugins, master\/agent nodes, and pipeline libraries to ensure the stability and scalability of our CI\/CD platform.<\/li>\n<li>Troubleshoot and debug automation code and interconnected systems to quickly identify and resolve issues, ensuring minimal disruption to services.<\/li>\n<li>Manage core GCP services including Compute Engine, Managed Instance Groups (MIG), Disk Snapshots, Storage, and Artifact Registry to support our application ecosystem.<\/li>\n<li>Containerize applications using Docker to ensure consistency across development, testing, and production environments.<\/li>\n<li>Implement and manage infrastructure as code, monitoring, and logging solutions to ensure high availability and performance of our systems.<\/li>\n<li>Collaborate with development teams to improve the entire software development lifecycle, from code to production.<\/li>\n<li>Develop and maintain workflows in Airflow to orchestrate complex data and application tasks.<\/li>\n<li>Troubleshoot and resolve production incidents, participate in on-call rotation, perform root cause analysis and perform key maintenance activities quarterly.<\/li>\n<li>Effectively communicate complex technical concepts to both technical and non-technical stakeholders through clear written and verbal communication.<\/li>\n<li>Strong expertise in managing and repaving Windows and Linux machines, ensuring security compliance through automated processes.<\/li>\n<li>Skilled in implementing security compliance measures, including repaving infrastructure, key rotation, and periodic updates to meet industry standards.<\/li>\n<li>Strong knowledge of monitoring and alerting systems, including Prometheus, Cloud Monitoring, and PagerDuty, to ensure system reliability and proactive incident response.<\/li>\n<\/ul>\n<h2>JOB QUALIFICATIONS<\/h2>\n<ul>\n<li>Bachelor&#8217;s degree in Computer Science, Information Technology, or a related field.<\/li>\n<li>Have over 5+ years of experience as a DevOps Engineer, SRE, or a similar role.<\/li>\n<li>Excellent verbal and written English communication skills are essential. You must be able to clearly document processes, write concise reports, and articulate technical issues to various audiences.<\/li>\n<li>Strong proficiency with Terraform for managing cloud resources.<\/li>\n<li>Hands-on experience with Jenkins, including managing Jenkins masters and agents, and writing Groovy scripts for pipeline automation.<\/li>\n<li>Proven ability to troubleshoot and resolve issues in complex, interconnected systems quickly and efficiently.<\/li>\n<li>Expertise in GCP services, including Compute Engine, MIG, Disk Snapshots, Storage, and Artifact Registry.<\/li>\n<li>Solid experience with Docker and containerization principles.<\/li>\n<li>Familiarity with Airflow for workflow management and orchestration.<\/li>\n<li>Strong understanding of Linux\/Unix systems, networking, and security principles.<\/li>\n<li>A proactive, &#8220;can-do&#8221; attitude with a strong sense of ownership and a desire to take on new challenges.<\/li>\n<li>Excellent problem-solving skills and a collaborative, team-oriented mindset.<\/li>\n<li>Maintenance Work Hours: The resource will need to work USA hours for three days every three months to perform maintenance on key production systems.<\/li>\n<\/ul>\n<h2>DEFINE YOURSELF AT BESTARION WITH ATTRACTIVE BENEFITS<\/h2>\n<ul>\n<li>Performance appraisal twice a year.<\/li>\n<li>Attractive benefits (13th salary, distinguished employee of the quarter and year,<br \/>\nseniority award\u2026)<\/li>\n<li>12 days off<\/li>\n<li>Lunch and parking allowance<\/li>\n<li>Healthcare and accident insurance<\/li>\n<li>Annual health check<\/li>\n<li>Working devices provided: Laptop and screen (If needed)<\/li>\n<li>Team Building activities in every summer, company trip, big annual year-end party every year, etc<br \/>\nFitness &amp; sports activities: football, tennis, table tennis, badminton\u2026<\/li>\n<li>Commitment to community development: charity every quarter, blood donation, public seminars, career orientation talks\u2026<\/li>\n<li>Support for personal loans such as home loans, vehicle loans, tuition fees<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Bestarion is a subsidiary of Larion, a well-established software outsourcing company in Vietnam with decades of experience delivering high-quality technology solutions. Inheriting Larion\u2019s strong foundation and technical expertise, Bestarion continues to grow as a trusted partner for clients worldwide. For over 15 years, Bestarion has provided innovative outsourcing services and business solutions to successful clients [&hellip;]<\/p>\n","protected":false},"author":26,"featured_media":10977,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-54860","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-jobs"],"_links":{"self":[{"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/posts\/54860","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/users\/26"}],"replies":[{"embeddable":true,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/comments?post=54860"}],"version-history":[{"count":4,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/posts\/54860\/revisions"}],"predecessor-version":[{"id":54870,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/posts\/54860\/revisions\/54870"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/media\/10977"}],"wp:attachment":[{"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/media?parent=54860"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/categories?post=54860"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/bestarion.com\/us\/wp-json\/wp\/v2\/tags?post=54860"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}