Show Date: Thursday, May 9, 2013
Join us as we discuss the Netflix OSS tools.
Watch Now
Panel
- Adrian Cockcroft github, twitter, blog
- Jeremy Edberg github, twitter
- Trotter Cashion github, twitter
- Bryan Berry github, twitter, irc: bryanwb, blog: devopsanywhere
- Brandon Burton github, twitter, irc: solarce
- Nathen Harvey github, twitter, irc: nathenharvey, blog
In the News
- Test Kitchen and Jenkins - A blog post from Joshua Timberman about how he’s set up a Jenkins build server to run test-kitchen on cookbooks.
- chefabulous - Bootstrap a Chef server on Amazon’s EC2 or using Vagrant with Fabric.
- The Application Cookbook Pattern, Berkshelf, and Team Chef Workflow - A blog post from Eric Reeves
Outline
- Introductions
- Chef News
- Business Week article
- Why is Netflix doing OSS
- Cloud-native
- No traditional HA tools from linux. Why not?
- S3 is shared filesystem for everything
- Have abstracted one layer above instances
- Oracle to SimpleDB transition
- Switching between NoSQL systems
- Configuration Management pushed up into the application itself
- Java as the language of choice
- There is a Python interface for some of the tools
- Closure, Groovy, etc. are other areas that are being explored
- AMI generation
- Aminator - Easily turn an app into an AMI
- Take base image, add some packages, run some chef recipes
- Looking at including chef in the base image for use during build time
- Code changes are always deployed as new AMIs
knife ec2 server create --bake
- https://github.com/opscode/knife-ec2/pull/110- Average lifetime of an instance is ~35 hours
- Aminator - Easily turn an app into an AMI
- Monitoring
- AppDynamics - Out-of-band monitoring
- Atlas
- Double exponential smoothing
- FFT - Fast Fourier transform - Look at traffic to be sure it’s going in the expected direction.
- Real-time FFT written in R is used for alerting. Other availability is determined after-the-fact.
- Circuit breakers
- Application Stack:
- Tomcat
- Cassandra
- Simian Army
- Chaos Monkey
- Chaos Gorilla - Will destroy an entire zone
- Latency Monkey - Reaches into Karyon and injects latency
- Is much better at finding issues / bugs than Chaos Monkey is
- Latency Monkey introduces latency, Hystrix should trip circuits
- Howler Monkey - Looks for overused resources and other auditing
- Security Monkey - Ensures certs are not expiring soon, etc.
- Janitor Monkey - Cleans-up unused resources
- Conformity Monkey
- How Trotter is using the Netflix stack
- archaius better than plan old properties files
- eureka, karyon, asgard
- Asgard - AWS console “on crack”. Built on Groovy.
- Necessary when you start deploying auto-scaling groups instead of auto-scaling images
- When would you not use auto-scaling groups?
- “Fork lift” operations - moving one app “to the cloud”
- Trotter recommends auto-scaling group even if the group size is one
- Time from deploy-ami to instance - about 3 minutes to start a fairly large instance (start 500 in about 8 minutes)
- How do I get started with the Netflix platform?
- Flux capicator - Flux Capacitor is a Java-based distributed application demonstrating the following Netflix Open Source components.
- Netflix Recipes RSS - RSS is a Netflix Recipes application demonstrating how all of the following Netflix Open Source components can be tied together.
- Netflix OSS Prize - A contest for Software Developers
- Visiting Netflix offices
Picks
Brandon
- Real Talk Podcast
- Mistborn Trilogy
- +1 for Trotter’s “Way of Kings” pick
Bryan
Adrian
- Adrian’s live demo benchmark on Cassandra on SSD
- Drift into Failure Sydney Dekker
- Netflix OSS Meetup
- Going Postal