Site Reliability Engineer (Buffer)
Location: Dallas, TXPosted On: 11/08/2023
Requirement Code: 66074
Requirement Detail
Job Description: Site Reliability Engineer (Buffer)
Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience
5~~@~~ yrs overall experience in Software Application Development & Engineering
2~~@~~ years of SRE experience
1~~@~~ yrs experience in AWS services
Experience in Typescript, NodeJs, and web development technologies
Proficient in scripting languages such as Powershell and/or Python
Knowledge of DevOps methodologies and the tools involved such as CI/CD concepts, CI/CD tools (Jenkins, CodePipeline, etc.), automation and config Help build a Site Reliability Engineering culture by sharing best practices, approaches, documentation, and code with other engineering teams
Define and setup KPIs to monitor Error Budgets
Implement strategies to ensure Error Budgets stay above the defined-acceptance levels
Define and implement response mechanisms when Error Budget thresholds are breached
Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually;
Able to troubleshoot complicated issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents, debug/troubleshoot infrastructure and application issues, including development and testing
Monitor application performance, take steps to improve overall application performance and stability and follow through with implementation (design, develop and test);
Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability;
Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability and organizational efficiency;
Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability;
Document your system knowledge as you acquire it over time, create runbooks, and ensure critical system information is readily available to those who need it;
Maintain and monitoring deployment, orchestration, of the servers, docker containers, databases, and general backend infrastructure;
Design, Develop & Test Terraform based Infrastructure as Code scripts to automate AWS infrastructure setup
Develop Typescript, NodeJS based REST/JSON Web Services deployed on AWS.
Compensation: 55-64.52 Hourly W2 (Open to C2C)