1. Software Engineer - Site Reliability-SRE
Summary
As a Site Reliability Engineer, you will be responsible for overseeing and managing servers, infrastructures and installed systems that shoulder our business, ensuring its reliability, availability and performance
What you’ll be doing
- Develop and maintain the large-scale infrastructure that powers our services
- Build and maintain monitoring, alerting, and trending operational tools in cloud environments
- Investigate, diagnose, and resolve performance and reliability problems in a wide range of large-scale and high-throughput services
- Contribute to handbook, runbooks, and general documentation
Who you are
- At least 3 years experience working as a software engineer
- Strong coding skills, preferably in Python
- Experience operating a production environment at high scale with emphasis on availability, latency and healthy customer experience
- Experience with infrastructure as code and configuration management tools such as Chef, Salt or Ansible
- Knowledge of Linux systems internals
- knowledge of Computer networking
- Knowledge of container orchestration tools such as Docker, Kubernetes is a plus
- Experience with AWS or other cloud environments.
- Excellent technical writing and documentation skills.
- B.S. degree or equivalent experience.
2. Software Engineer Infrastructure
工作职责:
The role of the Infrastructure team is to keep one of the world's biggest mobile E-commerce platforms growing. Help scale a massive, highly-available platform end to end. You'll design distributed systems, validate performance, factor in security, and proactively monitor every corner of our stack. We practice continuous integration and deployment. We have fluid teams and each engineer has the opportunity to own and work on all areas of the technology stack. When things do go wrong, you'll be on-hand to fight the fires.
What you’ll be doing
- Design, implement, and maintain the core infrastructure for all applications at Wish.
- Provide on-going maintenance and support of internal tools, improve system health and reliability.
- Create tools for automating deployment, monitoring and operations of the overall platform.
任职资格:
- Skill with one or more programming languages. Python or Go preferred.
- Familiar with Cloud Computing like AWS, Container like Docker, automation tools like Chef a plus
- Have passions on designing systems with high scalability, high availability and high reliability.
- Have experience on system infrastructure design preferred
- B.S. degree or equivalent experience.
email: [email protected]