Associate Principal Engineer, Performance and Site Reliability
Nagarro · зарплата не указана · Remote, , Sri Lanka · сайт компании · опубликовано 11 сентября 2023 г.
Описание вакансии
We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (15000+ experts across 26 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!
Must have Skills : Performance Tuning - General Experience, Performance Testing - General Experience, Cloud architecture (Strong), Observability,
Job Description : The ISV practice is focused on building new offerings targeted towards the diverse market segments in the horizontal tech space, creating new competencies in niche technology areas and supporting sales team to make convincing pitches of Nagarro’s capabilities in the ISV market. As a rapidly growing company, we are seeking geeks who want to squeeze the last bit of performance and maintain every second of uptime for an application. We are looking for a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability and resiliency are critical to support billions of online transactions everyday.
Profile requirements:
Strong knowledge of Java/.NET and SQL/NoSQL, event driven architectures
Hands-on experience with Locust, JMeter, Gremlin and other performance and reliability tools
Performance engineering experiences with a distinguished track record on technically demanding projects
Experience with performance testing, analysis, troubleshooting, and optimizing enterprise software - BS, MS, or PhD in Computer Science or equivalent work experience
Outstanding attention to detail and communication skills, ability to analyze / interpret technical data, ability to identify / resolve system bottlenecks, ability to identify potential limitations in software architectures, ability to write clear and concise technical documents, excellent troubleshooting and reasoning skills
Understanding of throughput, latency, memory and CPU utilization, concepts of multi-threading, cache, memory management, Java Virtual Machine, JIT
Working knowledge of HTTP load generation / measurement software (e. g. LoadRunner, JMeter, TestMaker, Silk Performer, etc)
Working knowledge of APM and Observability tools like Splunk, NewRelic, Prometheus and Grafana - Working knowledge of AJAX, HTTP, REST, JSON
Working knowledge of database tuning and administration (e. g. Oracle, DB2, Oracle AWR / STASPACK)
Working knowledge of operating system design / tuning / tools (e. g. Linux, Solaris, OProfile, VTune Roles and responsibilities: The Performance and Reliability engineering architect must demonstrate the ability to understand novel system architectures, pinpoint problem areas, and develop architectures and succinct test plans across multiple hardware and software tiers.
Design modern scalable cloud-native architecture capable of transacting billions of online transactions every day
Orchestrate SLO/SLI design workshops and set performance goals with product owners and engineering leaders
Work closely with management, product owners, developers and quality engineers to understand product requirements and design appropriate performance and reliability test plans to verify features performance and scalability requirements
Provide accurate scoping, clear strategies, and milestones for performance and reliability tests
Communicate status, risks to test complete dates, and offer mitigation strategies to ensure on-time delivery of all committed projects to cross functional teams
Participate in discussions and planning with development teams
Design and implement scalability, stability, stress, reliability, availability, and longevity tests using industry standard tools and custom developed software written in Java or . NET
Review automation implementations and encourage sound design principles
Review test plans, results analysis, capacity planning guides for in-flight releases
Work closely with the Production Success Performance Team to provide product and automation expertise for use in patches, research projects, sizing, and capacity planning
Coordinate technical presentations and training for capacity planning guides
Report weekly trends highlighting product performance and team status against committed milestones and projects