Infrastructure Operations Lead - DevOps
Bellevue, Washington, United States
For over 25 years, Epic Games has been making award winning games and game engine technology that empowers others to make visually stunning games and 3D content that brings environments to life like never before. Epic’s award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.
We think of “Epic” as the collective effort of smart, talented, passionate people who are dedicated to building the highest quality experiences possible for our developer and player communities. If you’d like to be part of something Epic while creating amazing games or incredible technology used across a multitude of industries, we’d love to hear from you!
Epic Games is growing our Systems Engineering team to support operations of our large-scale, highly available, secure, online services and infrastructure behind Epic Games and products. The person in this role will lead a global team of systems engineers that build and operate our online platform. You will work closely with software engineering, customer service, quality assurance, community, and product teams to provide online services that enhance the user experience for all of Epic’s systems.
The person in this role will be responsible for the following:
Providing technical direction across infrastructure projects
Act as an escalation point for technical and non-technical issues that span the team
Prioritize work as needed across time zones
Establish efficient operational and escalation procedures that reduce toil
Define and maintain a clear roadmap for operational excellence of our online platform
Execute and maintain internal and external SLAs developed with business stakeholders
Develop, implement and maintain policies, procedures and associated training plans for network resource administration, appropriate use, and disaster recovery
Hiring and maintaining a healthy and sustainable team
Ensure a high bar of quality across code reviews and practices
Removing blockers for releasing games
Maintaining and establishing relationships with 3rd parties and vendors
Lead large initiatives that have major impact for our business, players, and games
The ideal candidate will have a mix of the qualifications below:
Deep understanding of Linux internals
Experience with cloud tech such as AWS and Google Cloud
Able to write code when needed and complete code reviews
Experience with running large scale online services
Understanding of the pros and cons of different architecture patterns
Experience with database technologies such as Postgres and MongoDB
Please submit your resume and we'll be in touch soon.
This is going to be Epic!