Search Jobvertise Jobs
Jobvertise

Senior Cloud Systems Engineer
Location:
US-CA-Sacramento
Email this job to a friend

Report this Job

Report this job





Incorrect company
Incorrect location
Job is expired
Job may be a scam
Other







Apply Online
or email this job to apply later

As an engineer who is a part of the Production Engineering Team, you will be integral to the design, set up, automation, and maintenance of challenges and solutions the team takes on. The ideal candidate should have effective intercommunication skills to promote collaboration with developers, support engineers, customers, and senior management. They will work closely with development squads, our client-facing teams, and customers, as well as other engineers and developers gathering requirements, architecting, and constantly delivering quality improvements to our platform.

As an Organization Senior Cloud Systems Engineer, you will...

  • Be part of PagerDuty rotation responding to platform incidents and provide support for other engineers who are responding to customer issues
  • Use your daily interactions with the platform and your experience and skills to constantly improve our environment and ensure that issues do not reoccur
  • Maintain and augment our monitoring systems so that they alert on symptoms, instead of issues
  • Be proactive and take ownership in identifying, raising, and resolving issues or deficiencies you see anywhere in our environment
  • Produce and improve internal documentation and SOPs where they are missing or lacking quality or details
  • Write new Terraform and Ansible code and improve existing codebase to help automate and

remove toil from the team

  • Live-debug applications and issues, and identify, resolve or own resolution for functionality and performance deficiencies
  • Identify, and suggest or resolve performance issues with production applications and their

configuration

  • Contribute to our scale goals by identifying areas for improvement that can lead to higher

efficiency

  • Automate yourself out of a job

You will be perfect for this role, if you...

  • Have a bachelor's degree in computer science or other highly technical, scientific discipline
  • Comfortably "own" Terraform
  • Comfortably "own" Ansible
  • Comfortably "own" the Linux shell
  • Have a proactive approach to spotting problems, areas for improvement, and performance
  • bottlenecks
  • Have coding/scripting experience beyond simple scripts
  • Have an eye for edge cases, behaviors, creative solutions
  • Are experienced with configuration management
  • Have an unstoppable urge to fix what is broken
  • Efficiently balance speed/iteration and quality

As a Senior Cloud Systems Engineer, we expect you to...

  • Fluently follow existing best practices for maintaining supported application and platform health and writing and testing code
  • Make impactful decisions about your technical contributions
  • Understand how our production systems work
  • Handle vague scope or identify improvements in small areas
  • Manage your work with little-to-no supervision
  • Actively collaborate with others through technical documentation
  • Able to troubleshoot and contribute to resolution of moderate to complex production problems, write post-mortems on them
  • Write SOPs for issues encountered and common tasks
  • Able to automate repetitive tasks using purpose-written code or commercially available tool
  • Detect inefficient common operational patterns and processes
  • Design and implement monitoring solutions for common or critical problems
  • As a technical resource and expert, you should be able to...
  • Handle medium complexity issues' troubleshooting and resolution; be a core resource in
  • troubleshooting and resolving those issues
  • Have sufficient understanding of the Organization pipeline to be able to assist in troubleshooting medium to complex platform issues
  • Write quality, clean, and maintainable code, following company best practices with minimal

guidance

  • Develop sufficient domain understanding to sanity check and ensure the quality of their output, as well as review that of other team members
  • Write custom code of medium to high complexity in at least 2 languages
  • Be the responsible/SME engineer for 2 or more internally maintained supporting infrastructure components and have general knowledge of all platform components
  • Proactively research and keep up to date on the patterns, advancements, and evolutions of tools and technologies used in the Organization pipeline
  • Identify problematic patterns in the ` applications, processes and tools and suggest and

implement resolution options

  • Make small design decisions independently, making appropriate tradeoffs between simplicity and performance
  • Follow existing patterns to create new instances of projects, features, or architecture
  • Create novel architectures of small components within your area of expertise This includes

diagramming the architecture and assessing trade-offs made and patterns applied, assessing the effort for the change and approximate timeline

  • Understand the flow control of nearly any system including those outside of your area of

expertise, though unable to necessarily suggest improvements to systems outside of your area

  • Properly sense when to engage Security for a review of a potential change
  • Understand techniques used to troubleshoot and fix production bugs and issues
  • Develop solutions/code that reduces future operational burden (e.g. by adding appropriate self-healing, high levels of alerting/monitoring/logging, reducing alert noise, etc.)
  • Ensure that infrastructure resources are not wasted by consistently following provided best

practices and rightsizing instances, proactively identify areas that can benefit from changes that lead to cost savings

  • Contribute to the build and release tooling and infrastructure
  • Contribute to defining SLAs and SLIs

You should also be able to...

  • Be successful when working on a large feature or improvement of vague scope
  • Identify and push forward new features or enhancements that improve the functioning of a system or feature
  • Identify problems and contribute well-scoped solutions to the team's roadmap.
  • Focus your work on what is most valuable for the team
  • Make and communicate accurate time estimates for own work, potentially spanning multiple sprints
  • Manage projects that span multiple groups of stakeholders
  • Act as an effective facilitator for team meetings
  • Consistently communicate technical decisions through high-quality design docs, tech talks, and wiki contributions
  • Create documentation, train and mentor others
  • Be the role model for less experienced team members

Brahma Consulting Group

Apply Online
or email this job to apply later



AWS Cloud Data Engineer for Remote (10+ Years)
  Click here
Princeton, NJ
Strong experience in relational and non-relational data architecture Strong experience in data classification based on data type Strong experience in ...
Posted more than a week ago



Aws Data Engineer
  Click here
Atlanta, GA
Our client is seeking to bring on an AWS Data Engineer with a strong preference for time series databases. USD Hourly Description: The Judge Group is ...
Posted more than a week ago



Sr Data Engineer W
  Click here
Round Rock, TX
Data Engineer designs, builds and oversees deployment and operation of AWS Cloud solutions to capture, manage, store and utilize structured and unstru...
Posted more than a week ago



Data Engineer Aws
  Click here
Arlington, TX
The Data Engineer will be responsible for architecting, designing, and implementing advanced analytics capabilities. The right candidate will have bro...
Posted more than a week ago



Sr Cloud DevOps Engineer
  Click here
Houston, TX
Sr. Cloud(Both AWS and Azure) DevOps Engineer 100% onsite in Houston, TX Phone and Skype Long Term Job Description: The Sr. Cloud DevOps Engineer wit...
Posted more than a week ago


 
Search millions of jobs

Jobseekers
Employers
Company

Jobs by Title | Resumes by Title | Top Job Searches
Privacy | Terms of Use


* Free services are subject to limitations