Responsibilities
- Improve quality of pager alerts while reducing noise
- Maintain awareness of engineering initiatives across the organization and monitor their impact on stability, cost, and performance
- Keep infrastructure up-to-date to take advantage of security patches and new features
- Improve operational security without sacrificing engineering independence
About You
- At least 5 years of experience as a Site Reliability Engineer, or related job
- Ability to read and understand product code (writing product code is a nice-to-have!)
- Familiarity with the state of the art in cloud technologies, including common providers, specific tools of the trade, and their strengths and weaknesses
- Experience operating applications and databases with demanding scalability or availability requirements
- Proven expertise in modern container orchestration practices (we use Kubernetes on GKE)
- A strong understanding of the performance, architecture, tooling, and cost of cloud systems
- A security focused mindset with a solid understanding of incident response and risk mitigation
- A strong collaborator who is transparent about progress on tasks, seeks feedback early and often, works effectively with the team and customers
Benefits & Perks
- Extensive health, dental, and vision benefits
- Open vacation policy - we all work hard and take time for ourselves when we need it, no strings attached
- Three months of fully-paid parental leave to any employee welcoming a child into their home
- 401k and commuter benefits
- Generous stock options - we all get to own a piece of what we’re building
- Regular team outings and activities
- Flexible working hours and location
- Monthly employee gifts
- For those in office, catered lunches throughout the week and a fully stocked kitchen with all your favorite snacks (healthy & non-healthy)
Example projects
- Overhaul a fleet of nginx load-balancers handling 100s of thousands of requests per second without incurring downtime
- Work with members of the engineering team to identify and resolve spikes in processing latency in our ingestion worker pool
- Automate database scaling to improve operating cost while maintaining the ability to respond to traffic spikes
- Help build tools to streamline the onboarding and release process for customers using LogRocket's On-Premise offering
- Improve the performance and reliability of the system our Product Engineering teams use to both test and deploy software
Top Skills
What We Do
LogRocket combines session replay, error tracking, and product analytics – empowering software teams to create ideal product experiences across web and mobile apps. Located in Downtown Crossing, we’re on a mission to build the best possible frontend monitoring solutions for engineering and product teams.
Why Work With Us
LogRocket is the only analytics & error tracking platform that doesn’t want its users spending all their time staring at dashboards and sifting through noisy alerts. We’re always looking for better ways to help customers find the most severe things holding back their apps so they can spend their time doing what they love – building great software!
Gallery










LogRocket Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
Hybrid first company, most roles can be remote, and individuals are given the flexibility and trust to decide if and when they are in the office.