Job Description: We are hiring multiple Site Reliability Engineers (SREs) to join our growing team. The SREs will work closely with the DevOps team to implement standardized tools and practices to ensure high reliability and scalability of our systems.
Responsibilities:
- Maintain and enhance the reliability, availability, and performance of large-scale systems.
- Follow established DevOps guidelines and standards for tool development and system management.
- Develop automation scripts for monitoring, alerting, and incident response.
- Collaborate with the DevOps team to improve infrastructure and platform tools (e.g. spug.cc)
- Design and implement CI/CD pipelines using GitLab for application and infrastructure deployment.
- Manage containerized environments using Kubernetes.
- Monitor and analyze system metrics to optimize performance and efficiency.
- Implement disaster recovery and high-availability strategies to ensure system resilience.
Requirements:
- 3-8 years of experience in SRE or DevOps roles.
- Proficiency in Infrastructure as Code (IaC) using Terraform.
- Strong expertise in Kubernetes for container orchestration.
- Hands-on experience with CI/CD pipelines in GitLab.
- Proficiency in scripting languages like Python and Bash.
- Familiarity with cloud platforms such as AWS technology like EC2, KMS, VPC
- Strong problem-solving and collaboration skills.
About OSL
OSL Group Limited is Asia’s leading fintech and digital asset company publicly listed on the main board of the Hong Kong Stock Exchange under the ticker symbol 863.HK.
One of its subsidiaries, OSL Digital Securities Limited is the world’s first Hong Kong Securities and Futures Commission licensed and insured digital asset trading platform.
Founded in 2018, OSL Group has an established history in the sector and is recognised by many as the leader in providing comprehensive regulated and licensed digital asset solutions.