职位为远程,需要英文好。月薪人民币 4-6 万。
请符合要求的朋友发送简历到
addingtime.com@gmail.com ,合适的话会沟通下一步。
DevOps Infrastructure Engineer
Responsibilities:
Design, implement, and maintain scalable and resilient infrastructure using Terraform, primarily on Google Cloud Platform (GCP).
Collaborate closely with development and operations teams to integrate infrastructure as code (laC) practices into the continuous integration and deployment (CI/CD) pipelines.
Develop and optimize CI/CD pipelines using GitHub Actions, ensuring efficient build, test, and deployment processes.
Oversee the implementation and management of GCP services, focusing on optimizing performance, scalability, and cost.
Implement and maintain monitoring and alerting systems for infrastructure and applications to ensure high availability and performance.
Ensure security best practices are integrated into infrastructure design, following industry standards and frameworks.
Conduct regular infrastructure reviews and audits to identify potential improvements or optimizations in the system architecture.
Collaborate with the security team to implement security controls and compliance measures, particularly in the areas of cloud security and network architecture.
Develop documentation and standard operating procedures for infrastructure setup, configuration, and maintenance.
Provide expertise and support in incident response and root cause analysis for infrastructure-related issues.
Stay updated with the latest trends and technologies in DevOps, cloud computing, and infrastructure automatior.
Educate and train team members on Terraform, GCP, and DevOps best practices to foster a culture of continuous learning and improvement.
Requirements:
Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
Strong expertise in infrastructure as code (laC) using Terraform
Proficiency in managing and deploying applications and services on Google Cloud Platform (GCP).
Experience with CI/CD tools, especially GitHub Actions, and understanding of software development lifecycles.
Knowledge of containerization technologies (e.g., Docker, Kubernetes) and their orchestration in cloud environments.
Familiarity with network architecture, security protocols, and cloud security best practices.
Proficient in scripting languages (e.g., Python,Bash) for automation and configuration.
Strong problem-solving skills, analytical mindset, and attention to detail.Excellent communication and teamwork abilities.
Relevant industry certifications in cloud computing (e.g.,Google Cloud Professional,Terraform Associate) or DevOps are highly desirable
Job Description
GENERIC SKILLS
strong communication skills: able to communicate clearly and in a concise manner
strong collaboration skills: demonstrated experience working in highly collaborative environment and using collaborative coding best practices
strong problem solving skills: able to deal with complex scenarios and be able to find solutions that are scalable,maintainable, secure by design
proficiency with at least one of the following programming languages: go, java, python, nodeJS - preferably go
KUBERNETES
demonstrated experience of building, maintaining or managing highly scalable kubernetes implementations working knowledge of kubernetes ecosystem and tooling
strong understanding of kubernetes internals - including service/networking, pods lifecycle, statefulsets and extensibility NICE TO HAVE experience working with contorllers/operators NICE TO HAVE working knowledge of ArgoCD and istio
CLOUD/NETWORKING/SECURITY
strong hands-on experience on one cloud provider, ideally GCP (but AWS/AZURE are acceptable) understanding of cloud best practices, security in the cloud.shared responsibility
at minimum basic understanding of cloud networking - ideally strong on networking, private service connect (or equivalent for AWS/AZURE) routing and network security controls (VPC Service Control,private/restricted API,firewall,WAF) at minimum basic understanding of TLS, mTLS, concepts pf encryption in transit and encryption at rest. understanding of load balancing (L4/L7),DNS,CDN
experience architecting for resiliency and high-availability
HASHICORP VAULT
hands-on experience configuring and managing HashiCorp Vault
understanding of vault engines and vault authentication/authorisation mechanismHA and DR strategies for vault
DATABASES
knowledae of different databases and understanding of key differences between consistency models
experience confiaurina.managing and scaling at least one know database technology (Postgres,MySQL,Mongo,CouchBase, Cassandra,CouchDB.CockroachDB.BigTable,etc...)
understanding of db infrastructure management, scaling databases, performance optimisation
knowledge of user management and access control experience/knowledge of replication, data recovery, disaster recovery
SDLC
Experience with setting up and configuring CI/CD pipelines using GitHub Actions/Workflows
Understanding of CI/CD best practices such as automated testing, code quality checks, and deployment automation
Knowledge of integrating security checks and testing into CI/CD workflows (e.g., static code analysis, dependency scanning, container security scanning)
Experience with writing custom GitHub Actions/Workflows using YAML or Javascript
OBSERVABILITY
understanding of observability pillars
working knowledge of at least one leading observability tool, preferably Grafana
understanding of observability best practices in the context of kubernetes and distributed systems
experience creating dashboards and implementing meaningful alerts NICE TO HAVE experience working with OpenTelemetry