Skip to content

Instantly share code, notes, and snippets.

@kyleslattery
Last active December 17, 2015 23:09
Show Gist options
  • Save kyleslattery/5687532 to your computer and use it in GitHub Desktop.
Save kyleslattery/5687532 to your computer and use it in GitHub Desktop.

Dev/Ops at Artisan Mobile

Here's what you need:

  • 4+ years of managing cloud-based infrastructures
  • Experience maintaining 24-7 high availability uptime and support
  • Experience leading teams and having primary responsibility for operational performance and communications
  • Experience with automated environment scripting
  • Solid experience with performance monitoring and tests
  • Experience managing and optimizing big data deployments
  • Strong interpersonal and collaboration skills
  • A technical degree

Good indicators:

  • Significant comfort with the command line -- you've got a favorite set of flags for ps and netstat, and are pretty pumped about /sys replacing /proc.
  • Knowledge of how to manage large clusters; sshing into every machine one-by-one isn't scalable. Chef/Puppet are acceptable, but we're more excited about leaner solutions (e.g., Ansible, Salt).
  • Not scared of fixing kernel panics at 4am. Carrying a "pager" is an important part of the job. Knowing what to put into nagios and how to extract value from OpenTSDB / Cacti / Munin / Graphite definitely helps.
  • Engineering chops to not only benchmark database solutions, but throw out the slow and nonscalable ones for fast and fault-tolerant choices.
  • Experience with AWS / other elastic hosting solutions. Bare-metal may be in our future, but we're not there yet.

Here's what you'll do:

  • Day-to-day maintenance, monitoring, and troubleshooting of the cloud-based infrastructure powering our mobile development tool
  • Collaborate with the development and test teams to design, build and maintain dynamic web and mobile solutions built on a diverse stack - currently including Rails, Java, Jetty, Javascript, Python, Redis, and Mongo
  • Make recommendations to improve deployment, maintenance and up-time of infrastructure
  • Build and maintain a well documented, highly automated, no/low downtime deployment process to be exercised several times a month
  • Implement testing and monitoring tools/scripts
  • Ensure scalability of all resources in the infrastructure

Email: [email protected]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment