Remote sre Jobs in April 2020

16 Remote sre Jobs in April 2020

Post a job
    • 🔥 NEW: -50% on Remotive Community Memberships during COVID 19.
      Remotive Slack Community
  • Software Development (14) Software Development rss feed

    • 2 days ago
      Company Overview

      At Netlify, we're building a platform to empower web developers to build better, more elaborate web projects than ever before. We're aiming to change the landscape of modern web development. Netlify currently serves more than 700,000 developers worldwide.

      We’re a venture-backed company, and so far we've raised about $45 million from Andreessen Horowitz, Kleiner Perkins, Bloomberg, and prominent founders and professionals in our space.

      Netlify is a diverse group of incredible talent from all over the world. We’re ~40% woman or non-binary, and are composed of about half as many nationalities as we are team members.

      About the role:

      The role breaks down into three big parts:

      • Expansion: We are rapidly hiring and we need to expand the team. You will help us continue to build a diverse and inclusive team. This involves identifying skills the team needs, shepherding candidates through the hiring process, and building a more reliable, unbiased, and fair hiring process.
      • Delivery: Balancing technical debt and new features is always nuanced. You will partner with the product management team to manage this balance in accordance with the needs of the business. You will help define a project management process to help delivery and predictability with the goal of continuously shipping code to production.
      • Cultivation: You'll be instrumental in growing the careers of the individual engineers on your team. This means being their advocate and helping guide them in the direction they want. It also means being a culture driver on the team, fostering a positive, trusting, and supportive team.

      We have a small headquarters in San Francisco but we are a largely distributed engineering team. You will need to enable the team to work productively across different timezones. Fostering good habits of documentation, empathy, integrity to delivery on committed work are some of the key elements for success on our team. Experience working and managing remote teams is a big plus.

      Ideal Candidate:
      • Experienced manager of a technical backend team, especially around infrastructure development
      • Understanding of how engineering teams collaborate and track a project delivery
      • Familiarity, or willingness to learn, about the underlying system architecture of a PaaS (e.g. APIs, databases, distributed systems)
      • Commitment to designing a hiring process that is fair, efficient, and repeatable
      • Communicate priorities and expectations to both team and leadership
      • Passion for mentorship and advocacy for your team members in their career development
      • Ability to work across multiple timezones with remote colleagues
      • A desire to succeed personally through the health and success of the team
      About the team:

      The SRE team works as a support for the whole organization, from the different engineering groups to the sales and stability of the product. While we do oversee the incident management framework, we work to minimize the number of incidents via automation and tooling for the larger engineering team. The SRE team will focus on two big areas: stability and enablement.

      Stability because the SRE team is the frontline defense around issues and outages (e.g. cloud crashes, DDoS). The team will build the tooling needed to respond to these incidents and be prepared for the unexpected. This includes a 24/7 on-call rotation and working with other teams to prepare for incidents (e.g. more observability, alerting framework)

      Enablement because the team provides some of the core services that the rest of engineering relies on (e.g. kafka, kubernetes). They also empower developers to ship their code to production in a safe and repeatable way. This means working on different developer tools to provide more observability, better testing capabilities, and a smoother deployment process. Much of this is done by writing and using tools to further automation of rote or dangerous tasks.

      Right now the team is split between US and EU timezones to provide follow the sun coverage. Often coordination will have to take into account lag, timezones, and autonomy. The goal being to have SRE resources available whenever the company needs them.

      About Netlify

      Of everything we've ever built at Netlify, we are most proud of our team.

      We believe that empowered, engaged colleagues do their best work. We’ll be giving you the tools you need to succeed and looking to you for suggestions to improve not just in your daily job, but every aspect of building a company. Whether you work from our main office in San Francisco or you are a remote employee, we’ll be working together a lot—paring, collaborating, debating, and learning. We want you to succeed! About 60% of the company are remote across the globe, the rest are in our HQ in San Francisco.

      To learn a bit more about our team and who we are, make sure to visit our about page.

      Applying

      Not sure you meet 100% of our qualifications? Please apply anyway!

      With your application, please include: A thoughtful cover letter explaining why you would enjoy working in this role and why you’d like to work at Netlify. A resume or short listing of job history. (A link to a LinkedIn profile would be fine.)

      When we receive your complete application with the items above, we’ll get back to you about the next steps.

    • Impala (Western Europe)
      1 week ago

      Hello!

      Thanks for taking a look at the job description for a Chief Architect at Impala. We felt a little bit impersonal just throwing you right in there with words like “revolutionizing” and “disrupting”.

      As such, we want you to know that the person that wrote this job description apologises in advance for any clichés, startup tropes or sudden-insecurity-driven-panic-attacks that you might find in the description below.

      What is Impala now?

      Impala makes building travel software incredibly easy. We provide hotels with a powerful data management platform that makes connecting to software, hardware and room distributors extremely easy. Think Twilio for Travel.

      We launched in January 2019 and since then have grown at - quite frankly - a ridiculous pace. Our technology is already installed in hundreds of hotels worldwide, on six continents and categorises more than 1 billion data points.

      We are a growing team of exceptional people split across engineering, product and commercial that have built a fantastic, remote-but-far-from-distant community.

      Where is Impala going?

      We’re supported by some of the best investors in the business, the early stage backers behind Deliveroo, PillPack, SecretEscapes, Zoopla, Trivago and more. They’re on board because of our vision - revolutionizing a $1.6 trillion dollar market that’s been out-of-date for 20 years.

      Within the next two years, 25% of hotel bookings worldwide will flow through Impala technology and the product that we’re building will empower the entire ecosystem of travel technology (we already have over a thousand companies signed up).

      Anyone travelling anywhere will interact with products powered by Impala and to achieve this we’re growing the team to 50 people within the next year. So now is the most exciting time to be joining Impala as we go through a large push to scale up!

      A brief overview of what you’ll be up to in this role:

      As chief architect, you will act as the technical authority in a top-tier engineering team, ensuring that our technology can meet both immediate and future needs. You will report to (and work alongside) our VP of Engineering. 

      Technology sits at the heart of our business, which makes this a pivotal role with key responsibilities: 

      • You will lead on technology strategy & prioritization, managing short term technical risks and long term investment in our stack
      • You will be accountable for technical quality at Impala, including the reliability, security, and extensibility of our services and data 
      • You will ensure that monitoring, metrics and SRE practices are in place across all teams and services, to measure and meet high service SLA’s / expectations 
      • You will be the direct leader of an enablement group, which provides specialist  technical/DevOps/SecOps support to our development teams
      • You will provide technical direction and mentorship across teams to support the development of individuals 

      You should join us if: 

      • You have 10+ years of relevant and diverse product and engineering experience, with a period of executive-level responsibility at an organization with a strong engineering culture
      • You are an expert in the architectural design, development, and operation of large scale, high availability services 
      • You have demonstrated experience with modern web services, distributed service-oriented architectures and services that handle large quantities of data, across multiple different stacks. 
      • You aren’t afraid to get hands-on when necessary to demonstrate an approach / principle 
      • You are comfortable leading and mentoring teams and individuals

      Where in the world will you be working?

      This is a remote position. This means you can work from anywhere +/- 2 Hours of London, timezone wise (and less than a 3 hour flight).We are a remote-first company, our whole engineering team is fully distributed! This was a very deliberate choice to prioritise work-life balance and ensure we’re able to accommodate the best people in Europe. We have loads of remote initiatives to ensure you’re fully setup!:

      • We give you an additional generous Personalised Workspace Budget for things like monitors, keyboards, desk chairs and headphones.
      • We offer an office stipend (you can furnish a home office with a generous monthly stipend, or choose a co-working space near you).
      • You’ll travel once a quarter to meet and socialise with the rest of the engineering team somewhere in Europe.
      • You’ll have a generous Social Budget to travel to London based social events. You'll also have a special 'Loved Ones Budget' that we're incredibly proud to offer, we'll tell you more about that when we speak to you!
      • We have a dedicated Remote Experience Manager in the team, who, amongst other things, will be ensuring you're happy, integrated and empowered whilst working remotely.
      • We will give you a brand new high-spec laptop when you start, giving you the choice between a Thinkpad or Mac.

      Please note, we only accept candidates in Western Europe because of timezone and travel time differences. We make no exceptions (we don’t have time to!).

      What you’ll get if you join us: 

      • We’re in the top 14% of companies in the UK for holiday that we offer, at 36 days - so you’ll have plenty of time to rest and recharge
      • We also offer Impalans one Unsick day per year, so you can proactively look after your health (time to finally book that dentist appointment!)
      • Impala believes that great people should be able to focus on developing mastery of their skills, even if they don’t want to manage people. Here you’ll be able to access 3.5x the average annual raise, even staying in the same role.
      • We put our money where our mouth is when it comes to investing in your development - you’ll have access to a generous yearly learning budget to help you realise your potential. 
      • We offer all engineers a Weekly Recess Hour. This is an hour to take off and step out from work and invest in You Time! Lawn mow, yoga, walk or movie anyone?
      • We offer an extended lunch break option, meaning you can take an extra hour lunch break, and we trust you to make up for it later in the day.
      • We offer a stipend to set up your ideal place of work. You can choose between the option of ÂŁ150 per month to build out your home office, or we’ll pay for your membership at a coworking space near you up to ÂŁ350.
      • In addition, we give you a special Personalised Workspace Equipment budget for things like monitors, keyboards, headphones or desk chairs.
      • You’ll travel once a quarter to meet the rest of the team in an engineer colocation week, somewhere in Europe.
      • You’ll have a Social Budget to travel to/from & stay in London for a night, for our monthly Impala Social.
      • You’ll have a generous Friend & Family Budget, to get your close friends and family over once a year.
      • We’ll organise socials around your area with other remote workers. 
      • We have a Health & Wellbeing scheme in place, offering you the option of a monthly gym membership reimbursement each month, or a health food voucher each month; access to Headspace; Vitality health insurance (UK only); plus the chance to win a massage monthly. Plus loads more.
      • Perkbox. A platform designed to help you to live a better life by personalising, managing, delivering and measuring the best company perks in real-time. 
      • You’ll be working somewhere that encourages collaboration, is transparent, has a strong focus on wellbeing, and doesn’t just pay lip service to it’s Core Values.

      Sound interesting? 

      If you are excited to learn more then please check out our Medium page which includes more information about who we are, what we do, what matters to us, and our culture.

      Want to know more about our CEO, or want a deeper understanding of our what and why? Watch this video. https://www.youtube.com/watch?v=nxBZuoxCbKM

    • Wikimedia Foundation, Inc.
      2 weeks ago

      The Wikimedia Foundation is hiring two Site Reliability Engineers to support and maintain (1) the data and statistics infrastructure that powers a big part of decision making in the Foundation and in the Wiki community, and (2) the search infrastructure that underpins all search on Wikipedia and its sister projects. This includes everything from eliminating boring things from your daily workflow by automating them, to upgrading a multi-petabyte Hadoop or multi-terabyte Search cluster to the next upstream version without impacting uptime and users.

      We're looking for an experienced candidate who's excited about working with big data systems. Ideally you will already have some experience working with software like Hadoop, Kafka, ElasticSearch, Spark and other members of the distributed computing world. Since you'll be joining an existing team of SREs you'll have plenty of space and opportunities to get familiar with our tech (Analytics, Search, WDQS), so there's no need to immediately have the answer to every question.

      We are a full-time distributed team with no one working out of the actual Wikimedia office, so we are all together in the same remote boat. Part of the team is in Europe and part in the United States. We see each other in person two or three times a year, either during one of our off-sites (most recently in Europe), the Wikimedia All Hands (once a year), or Wikimania, the annual international conference for the Wiki community.

      Here are some examples of projects we've been tackling lately that you might be involved with:

      •  Integrating an open-source GPU software platform like AMD ROCm in Hadoop and in the Tensorflow-related ecosystem
      •  Improving the security of our data by adding Kerberos authentication to the analytics Hadoop cluster and its satellite systems
      •  Scaling the Wikidata query service, a semantic query endpoint for graph databases
      •  Building the Foundation's new event data platform infrastructure
      •  Implementing alarms that alert the team of possible data loss or data corruption
      •  Building a new and improved Jupyter notebooks ecosystem for the Foundation and the community to use
      •  Building and deploying services in Kubernetes with Helm
      •  Upgrading the cluster to Hadoop 3
      •  Replacing Oozie by Airflow as a workflow scheduler

      And these are our more formal requirements:

      •    Couple years experience in an SRE/Operations/DevOps role as part of a team
      •    Experience in supporting complex web applications running highly available and high traffic infrastructure based on Linux
      •    Comfortable with configuration management and orchestration tools (Puppet, Ansible, Chef, SaltStack, etc.), and modern observability infrastructure (monitoring, metrics and logging)
      •    An appetite for the automation and streamlining of tasks
      •    Willingness to work with JVM-based systems  
      •    Comfortable with shell and scripting languages used in an SRE/Operations engineering context (e.g. Python, Go, Bash, Ruby, etc.)
      •    Good understanding of Linux/Unix fundamentals and debugging skills
      •    Strong English language skills and ability to work independently, as an effective part of a globally distributed team
      •    B.S. or M.S. in Computer Science, related field or equivalent in related work experience. Do not feel you need a degree to apply; we value hands-on experience most of all.

      The Wikimedia Foundation is... 

      ...the nonprofit organization that hosts and operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference. We host the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive financial support from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.

      The Wikimedia Foundation is an equal opportunity employer, and we encourage people with a diverse range of backgrounds to apply.

      U.S. Benefits & Perks*

      • Fully paid medical, dental and vision coverage for employees and their eligible families (yes, fully paid premiums!)
      • The Wellness Program provides reimbursement for mind, body and soul activities such as fitness memberships, baby sitting, continuing education and much more
      • The 401(k) retirement plan offers matched contributions at 4% of annual salary
      • Flexible and generous time off - vacation, sick and volunteer days, plus 19 paid holidays - including the last week of the year.
      • Family friendly! 100% paid new parent leave for seven weeks plus an additional five weeks for pregnancy, flexible options to phase back in after leave, fully equipped lactation room.
      • For those emergency moments - long and short term disability, life insurance (2x salary) and an employee assistance program
      • Pre-tax savings plans for health care, child care, elder care, public transportation and parking expenses
      • Telecommuting and flexible work schedules available
      • Appropriate fuel for thinking and coding (aka, a pantry full of treats) and monthly massages to help staff relax
      • Great colleagues - diverse staff and contractors speaking dozens of languages from around the world, fantastic intellectual discourse, mission-driven and intensely passionate people

      *Eligible international workers' benefits are specific to their location and dependent on their employer of record

    • Astronomer helps organizations adopt Apache Airflow, an open-source data workflow orchestration platform. We run a managed SaaS offering (Astronomer Cloud), as well as a product that our customers install into their own Kubernetes cluster (Astronomer Enterprise).

      We're looking for infrastructure-oriented people to join our Cloud Operations Team, which is responsible for building and scaling our SaaS offering.

      Responsibilities:

      • Work with our team of SREs and Developers to operate our secure, highly automated runtime environment to dynamically scale for our customer needs

      • Be the person to track uptime and cost metrics on a daily basis and plot against SLAs and budget

      • Add metrics, alerts, and auto-remediation and auto-scaling / cleanup capabilities as needed for uptime and cost management

      • Add and track security telemetry data, including management of employee access for administration and customer support

      • Deploy code to production while following release management processes including canary deployments

      • Participate in on-call rotation to meet our SLAs

      • Follow procedures for escalations to engineering and communication of resulting status and ongoing communication with the customer

      Requirements

      • Kubernetes Experience (Docker, Kubernetes, Helm)

      • Cloud Automation Experience (Terraform, other tools)

      • Cloud Networking (AWS/GCP/Azure)

      • Comfortable communicating with customers

      Bonus Points if you're familiar with:

      • Apache Airflow

      • ElasticSearch/Kibana

      • Prometheus/AlertManager/Grafana

      • Redhat Openshift

      At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

    • Aptible (North America)
      3 weeks ago
      About Aptible

      Our Vision

      We see a future where it’s easy to bring a great idea into the world using the internet, while respecting data security and privacy. The next generation of businesses will design security and privacy into their operating processes. If every business is going to be a software business, every business will need to be a security business.

      We’re working to make information security a core competency of every startup. We envision a world in which startups have access to great information security, are empowered to focus on their businesses instead of on compliance, can scale faster and more efficiently, and are confident that they're creating quality products.

      Our Team
      We wrote the Aptible Owner's Manual to help members of the company get a clear sense of what this team is — what we mean by “us.” We've now made this open to the world and invite you to read it, as a prospective member of the Aptible Team.

      Our Commitment to Diversity and Inclusion
      We prioritize diversity within our team and value different perspectives, educational backgrounds, and life experiences. We encourage people from underrepresented backgrounds to apply.

      About this Role

      We're looking for a Site Reliability Engineer to improve the infrastructure, reliability and security of our PaaS product, Aptible Deploy.

      Our next SRE will be an early member of the Aptible team. Reporting to our Customer Reliability Engineering Manager, you will be responsible for reducing the overall amount of Site Reliability work and determining an SRE roadmap.

      Our Commitment to Diversity and Inclusion
      We prioritize diversity within our team and value different perspectives, educational backgrounds, and life experiences. We encourage people from underrepresented backgrounds to apply.

      Your Impact
      • You will own and manage both internal and external tooling like PagerDuty
      • You will develop tools and processes to make monitoring, detection and issue resolution easier
      • You will prioritize and perform proactive maintenance and improvements of the entire system
      • You will help assess and remediate vulnerabilities and risks as a member of the security team
      • You will be a key member of our 24/7 oncall rotation
      You Competencies
      • You have some familiarity with one or more of the technologies that we use including: Ruby, Docker, Postgres, MySQL or Redis
      • You have experience running production environments on AWS
      • You have 3-5 years as software engineer or SRE or equivalent experience
      Our Interview Process

      We seek to make the experience of interviewing with us as delightful, efficient, fair, respectful, and transparent as possible.

      A typical process at Aptible might include the following steps, and takes approximately 3 Weeks to complete. We try to move as quickly as possible, but if you have any time constraints, please let us know and we'll do our best to accommodate.
      1) An Introduction to Aptible with the Hiring Manager (30 Minutes via Zoom)
      2) A Discussion-Based Interview with an Aptible Team Member (45-60 Minutes via Zoom)
      3) A Take-Home Work Sample Exercise (NB: You will be compensated for completing this.)
      4) A Discussion-Based Interview with an Aptible Team Member (45-60 Minutes via Zoom)

      We believe that the Work Sample Exercise is an important part of the process, in that it gives you the opportunity to demonstrate your skills in a concrete way. We take the time to design these exercises such that they: a) give you a view into the actual work you'd do at Aptible, and b) are standardized, so every candidate is evaluated using the same criteria.

      Lastly, Aptible conducts calls with 3-4 References, ideally managers who have directly supervised you in the past and/or colleagues who can speak to your work.

      If you have a disability or special need that requires accommodation, please let us know by completing this form, and we will reach out soon to see how we may be able to assist.
    • Thought Industries (US - East Coast)
      1 month ago

      As our US east-coast based Site Reliability Engineer with solid coding skills you will be working with our Development team to ensure the availability, reliability, scalability, and performance of our platform’s automated cloud infrastructure. You will be part of a larger, distributed team that is focused on improving the business of learning in the cloud environment.

      As part of our SRE team, you will:

      • Work with SRE team and other developers to code, build, maintain, and monitor core pieces of infrastructure.

      • Take part in migrating data and other platform-related tasks (via automation when possible).

      • Work with our wider product team to meet new platform needs.

      • Take part in on-call rotation, responding to alerts and handling platform outages (particularly during EST hours).

      As an SRE Engineer, you:

      • Understand the requirements and challenges of hosting applications in the cloud

      • Understand the flow of a web request through a cloud application stack

      • Are mindful of risk-management and testing new production changes thoroughly

      • Feel the need to automate your problems away

      As an Engineer, you:

      • Communicate and collaborate well in a distributed team

      • Take a pragmatic and thoughtful approach to solving problems

      • Are a self-starter who can take a challenging task and run with it

      • Care about the quality of your work

      • Have empathy for your users and team

      • Enjoy learning new skills and building solutions to difficult problems

      Our Ideal Candidate:

      • 2+ years of engineering experience

      • Experienced in building, managing, monitoring, testing and optimizing a production cloud application.

      • Confident in their overall coding & application development skills

      • Fluent with one scripting language (ideally python, bash)

      • Has working experience with Node.js

      • Experienced with container-based deployment (e.g. K8s)

      • Experienced with AWS and its various offerings

      • Experienced with at least one flavor of linux and its setup and maintenance

      • Experienced with maintaining a production application across multiple regions

      The company

      Thought Industries is a startup in the Online Learning space. We enable training and software companies to launch and monetize external learning programs — think Shopify meets Udemy/Coursera.

      We are a growing, well-funded technology company, with a talented team and a clear vision. This is a unique opportunity to take a lead role at an exciting SaaS software company with a robust cloud-based platform. We hire talented people who are self-motivated and team orientated. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or veteran status.

      To apply: Please submit your cover letter explaining what kind of role you are looking for and why Thought Industries specifically interests you along with your resume.

    • Bold Penguin (Eastern Time +/- 2 hours)
      1 month ago

      We didn’t create Bold Penguin because commercial insurance is broken. It isn’t. But as the world has gotten more connected and digitized, commercial insurance lags behind—creating a fragmented landscape where businesses, agents, and insurance companies struggle to interact in a smooth and easy way. That’s why we’ve built a highly efficient exchange that cuts the friction out of commercial insurance by connecting everyone to the right quote in record time.

      Powering the world of insurance is no small feat, so we’ve brought on a team that's not only incredibly talented but also passionate about our potential to upgrade the entire industry. As more and more companies big and small depend on our technology to operate in the commercial insurance space, we’ll need the best talent all around to support our growth. That’s why we’re looking at you (yes, you!) to make a bold move and join our adventure.

      Your  Role

      As a Cloud & Site Reliability Engineer, you will be a subject matter expert in building highly reliable, highly scalable features and infrastructure. You’ll use DevOps principles to ensure that Bold Penguin’s software systems are always available and ready to scale to meet growing demands. 

      Click here to learn more about DevOps on the glacier

      What You’ll Do

      • Ensure the reliability, performance, and availability of our platform by working as part of a cross-functional product team
      • Participate in agile ceremonies such as iteration planning, retrospective, and daily standups
      • Be part of the shared on-call rotation and proactively research possible issues affected the availability of our platform
      • Understand and clearly articulate tradeoffs in architecture decisions with regards to cost, security, operational efficiencies, performance, and availability
      • Build and maintain infrastructure with executable code (IaC) and automated delivery pipelines
      • Be passionate about Cloud/DevOps/SRE concepts such as Immutable Infrastructure, Cattle vs Pets, Infrastructure as Code, Delivery Pipelines

      Skills & Qualifications

      • Deep, hands-on expertise with AWS Cloudformation and other Infrastructure as Code tools
      • Experience with Amazon Web Services; specifically EC2, ECS, ELB, CodePipeline, RDS, Redshift, S3, IAM, and Lambda
      • Ability to articulate Cloud & DevOps concepts to a variety of technical & non-technical team members
      • Bonus points for expertise in implementing security & compliance frameworks such as SOC/2, NIST 800-53, and NIST 800-171 especially in Amazon Web Services
      • Bonus points for AWS Certifications 
      • Bonus points for familiarity with microservices architectures, Ruby on Rails and/or ETL tools such as Fivetran.
      • Experience working at technology companies and startups desirable
      • 2-4 years + of working remote, full time, and/or with full time co-located teams across different time zones.

      BONUS POINTS

      • Full-stack expertise in multiple tiers of modern web applications (e.g. front end, back end, infrastructure, etc.)
      • Open-source contributions and/or speaking experience.
      • Previous work experience in insurance and/or experience with policy rating very desirable.
      • You love Penguins! ;P

      TRAVEL TO THE "GLACIER" (please read)

      • We are firm proponents of "seeing eye to eye by meeting face to face". As such, our remote team travels in once a quarter for a full day of collaboration, goal setting, team building, etc.  Are you able to make this work?  In addition to this we also ask that, if hired, you are able to make the first week onsite for onboarding/training. 

      PENGUIN PERKS

      • For a healthy colony.
        • Our plan covers 50% of your Medical Premiums – Health - HRA, Dental, Vision, and Life Insurance, as well as Short & Long Term Disability (Trust us, the benefits are great!
      • Penguins plan for the future.
        • 401k Match program, up to 4%! 
      • Parental Leave
        • 16 weeks of parental leave (your kids need you there!)
      • Need a vacation?
        • Unlimited PTO - Please take a vacation - you need it and we applaud it and in fact we require you take 10 days off!
      • Hungry? Thirsty?
        • We offer free snacks and drinks, as well as catered lunch every Monday (even to our remote employees...nomb nomb nomb)
      • Penguins need to learn!
        • We support your professional growth. Certifications, training, memberships, and conferences are actively encouraged—and often covered.
      • Penguins are social creatures and love to play!
        • We have frequent happy hours, company events, and outings. What kind of company would we be if we didn't have some fun!?!? 
      • Penguins give back.
        • We offer volunteer opportunities every month!  There is no better feeling than giving back =)
      • Don’t want to move to Columbus?
        • We offer up to 100% remote engineers!
        • You must be OK visiting the office for a day or two every quarter - we are all about that camaraderie! 

      Penguins believe in inclusion. That’s why we’re proud to be an equal opportunity employer that considers all qualified applicants regardless of race, color, religion, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, beak size, or inability to fly.

    • InVision is the digital product design platform used to make the world’s best customer experiences. We provide design tools and educational resources for teams to navigate every stage of the product design process, from ideation to development. Today, more than 5 million people use InVision to create a repeatable and streamlined design workflow; rapidly design and prototype products before writing code, and collaborate across their entire organization. That includes 100% of the Fortune 100, and organizations like Airbnb, Amazon, HBO, Netflix, Slack, Starbucks and Uber, who are now able to design better products, faster.  

      Our team is in search of a Lead Software Engineer - SRE to help us change the way digital products are designed.

      This role will help ensure uninterrupted service for InVision customers and act as a force multiplier for product teams to deliver better software faster. This role will have ownership of foundational reliability services and a big impact on our product.

      About the team:

      The reliability team is dedicated ensuring resiliency at scale. You will lead design, development and delivery of solutions which to enhance the scalability, availability, and efficiency of microservices. This role is will have direct impact on platform and product teams by identifying problems, anti-patterns, and opportunities to add resilience to applications. Our tech stack includes but is not limited to Kubernetes, AWS, Kafka, Kinesis, Go and Java based microservices.

      What you’ll do:

      • Provide leadership and guidance in addition to participating in hiring efforts
      • Uncover and advocate reliability, performance and upstream solutions with internal stakeholders
      • Create tools for monitoring, self-healing infrastructures
      • Code in Golang!
      • Develop solutions for circuit breaking, chaos testing, load shedding, rate limiting, server side and event bus resiliency
      • Identify performance bottlenecks and troubleshoot performance issues
      • Collaborate to problem solving and design
      • Engage in service capacity planning and demand forecasting, software performance analysis and system tuning
      • Mentor other developers and site reliability engineers in new technologies being implemented

      What you’ll bring (we encourage you to apply even if you don’t meet every single one):

      • Demonstrated Leadership experience
      • Experience finding anti-patterns and engineering reliability at scale
      • 1+ years of experience with Golang
      • Good communication skills and experience leading projects
      • A degree in computer science, software engineering, or a related field, or equivalent experience
      • Systematic problem solving approach, coupled with a strong sense of ownership and drive
      • A passion for creating performant and reliable applications

      About InVision:

      InVision offers an incredibly unique work environment. The company employs a diverse team all over the world. Each InVision team member is given the freedom and tools to do their best work from wherever they choose.

      The benefits we offer in the United States and Canada include competitive health plans and retirement plans. Some InVision-wide benefits offered to all employees across the globe include a flexible vacation policy, monthly coffee shop stipends, annual allowances for books related to your profession, and home office setup & wellness reimbursements. InVision is an international employer so some benefit offerings will vary from country to country.

      InVision is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. If you have a disability or special need that requires accommodation, please let us know.

    • As a Senior Security Engineer on the SRE Team at Skillshare, you’ll play a key role in helping us continuously improve our security programs to ensure the best experience for our users through the safety of our technology and data. 

      This role spans strategic work of putting in place forward-looking initiatives as well as responding to external threats on an ongoing basis, which means the opportunity for impact across the board.  We’re scaling quickly and are excited to bring someone onboard who can help us proactively tackle challenges – both in the day-to-day operations and anticipated future ones. 

      You’ll collaborate with the other members of the SRE team as well as the product development teams to plan and implement various security initiatives. We’ll look to your strategic expertise, reliable execution, and sound judgment to improve and maintain our security infrastructure, along with creating and improving processes for maintaining a secure product and environment.

      You’ll be joining a team that’s passionate about technology, and helping pave the way for building products together that we’re proud of. We’re excited to meet you.

      What you'll do:

        • Improve, monitor and maintain our information security.

        • Execute security initiatives related to infrastructure, product, and data.

        • Make strategic recommendations and improvements to our security.

        • Work with application developers to improve the security of various product features.

        • Proactively prep and train developers and raise the security awareness of everyone in the organization.

        • Quickly and proactively respond to incoming security threats.

        • Continually assess, address and report on the levels of threat and preparedness.

      Why we're excited about you:

        • 7+ years of experience building, supporting and securing cloud-based web infrastructure with AWS.

        • Knowledge of best security practices for building web applications.

        • Experience with security monitoring tools.

        • Experience in compliance with industry standards such as PCI, OWASP, NIST, GDPR etc.

        • Experience with Single Sign-on (SSO) for internal systems.

        • Understanding of and ability to deal with and prevent typical security threats and risks.

        • Deep understanding of web application infrastructure.

        • Working knowledge of software engineering.

        • Strong communication skills – you’re a natural collaborator and can report out to stakeholders of all levels.

        • Tech stack knowledge: Docker (Kubernetes experience is a plus), Linux, DataDog, AWS security products, MySQL.

      Why you're excited about us:

        • Impact: you’ll play a key role in shaping the direction of a comprehensive security approach long-term.

        • Growth: Our team is small, so you’ll have room to wear a lot of hats and take on more responsibility over time. 

        • Our mission: We are building a learning ecosystem for the new economy and changing millions of lives for the better.

        • Our team: We have a passionate, smart team that is a lot of fun to work with.

        • Your life: We take pride in our flexibility. Working remotely is part of how we need to work these days. You’re a professional, and we trust you to do what you need to do.
    • Description
      As a well rounded systems reliability engineer with a diverse set of skills, this makes you one of the very best people to troubleshoot, monitor the platform, and be on top of releases. You should definitely be the type that appreciates diversity in your day, and challenges outside of your comfort level! A typical day might include these types of activities:

      - Taking charge of the build process and pipelines across the platform.
      - Being keenly aware of systems architecture and automatically adding in redundancy and backup for new systems and software.
      - Assist in troubleshooting a complex customer issues across network devices, server hardware, virtual machines, in-house software and open source software. Not only can you run tcpdump with filters on the command line, but you can read it there also.
      - Adding additional monitoring and alerting on all systems across the platform that will help you identify one of those annoying intermittent issues you have seen in the logs.


      Skills & Requirements
      The right candidates will probably have a CS degree, solid scripting and automation skills, great troubleshooting skills across the OS and network, a good grasp on security concepts, experience with routing platforms and protocols, and enjoy working collaboratively.

      Specific requirements include:

      - Experience in automating tasks through scripting. You should be very well versed with Python, and probably a few other languages. We will ask for script samples.
      - High degree of drive to improve and automate your environment with minimal guidance
      - Be able to solve for immediate, and plan to accommodate for future problems
      - Experience with Ansible, Salt, Chef, Puppet, Terraform, or CFEngine. Experience with Ansible and Terraform preferred.
      - Experience with build pipelines, integration testing and Jenkins.
      - Experience administering a wide variety of *nix platforms, including multiple Linux variants.
      - Solid understanding of Layer 2 and Layer 3 protocols including IPv4/6, 802.1Q, BGP, MPLS, etc., and understanding a multitude of different network architectures.
      - Experience with Google Compute, AWS, or other cloud based compute and database services.
      - Understand the importance and implementation of backup and redundancy across many layers of databases, systems, and network configurations.

      Some knowledge that would be a huge plus:

      - Familiarity administering/troubleshooting Juniper/Cisco/Arista platforms.
      - Experience with extremely large scale network management and monitoring.
      - Experience with Postgresql, TimescaleDB, ElasticSearch

    • 1 month ago
      Do you want to be part of a team that helps over one million designers create amazing products every day? We're looking for a full-time Site Reliability Engineer to join us at Sketch.

      We are building a cloud platform that helps teams to collaborate on Sketch designs in every possible, efficient, and beautiful way.
      Your mission will be to shape this cloud infrastructure defining and building every piece, from development environments to metrics processing and observability, including security policies, network design, deployment strategies, high availability, etc...

      Our stack is currently based on a mix of serverless and traditional server applications. You will propose new projects to make sure this platform has the best technology for our product goals and our team. You are proactive and have a "get the job done" attitude. You are also not afraid of getting deeper and deeper in order to debug a problem, especially in production.

      There are always many things to do at Sketch. You need to be an organized and communicative person. You are used to prioritizing Infrastructure tasks and projects and you like to back your decisions and proposals with arguments. As a part of a team with very skilled people being an excellent team player is essential.

      As a remote organization
      There are three keys to us. It requires excellent communication skills as well as good written and spoken English. You need to be self-motivated and be comfortable working in a remote position. And also it requires high-quality documentation. You to have an eye for detail, in general, and especially for the documentation.

      We believe in
      Automated, simple, and quality tested infrastructures. It's essential that you have experience developing infrastructures as code and you enjoy coding. You are very critic with your own job and you always try to find the cleanest way to do it. You understand well the right balance between adopting new technology, current stability, maintainability, and simplicity. Like us, you also believe that speed and reliability are two of the most important web platforms features. You like to design and build processes and platforms that run flawlessly and fast.

      The ideal candidate
      • Has experience with different stacks (mainly Linux based), technologies and production models and has participated actively on the build of important pieces of a cloud platform.
      • We would like to know as much about you as possible. Contact us and tell us about your experience and your motivations for this job and send us any link of something that represents you or your experience.
      Even if you feel you are not 100% exactly the person described, we would still love to hear from you. We value anything that makes you different from the description.
      Even if you're not able to tick all of these boxes, we would still love to hear from you.
    • 1 month ago
      To join our growing team, SugarCRM is currently seeking an experienced Site Reliability Engineer.  This role can be based in one of our U.S.-based offices or remote.

      Impact you will make in the role:
      • Manage applications in a CentOS Linux-based environment
      • Build repeatable infrastructures with Ansible
      • Develop and execute plans for rolling out new technologies rapidly
      • Improve monitoring infrastructure, build out data aggregation and alerting rules
      • Work closely with engineering to build scalable solutions
      • Triage tickets raised by our support organization and implement fixes
      • Support our private and public cloud environments and customers
      • Mentor other members of the Operations team
      • Participate in an on-call rotation

      Expertise you will bring in:
      • BA/BS in Computer Science with Network Engineering or Information Systems emphasis, or equivalent work experience
      • Extensive knowledge with container orchestration technologies including Docker and Kubernetes
      • 6+ years experience in an Operations or Systems Administration role
      • Superior Unix administration skills
      • Extensive knowledge of common Internet Protocols
      • Extensive knowledge of TCP/IP
      • Experience with virtualization and cloud technologies
      • Hardware management, network switch and router administration experience
      • Experience with Apache, MySQL, and PHP in a production environment at scale
      • Strong knowledge of version control systems and hands-on experience with Git
      • Experience with writing code around infrastructure automation
      • Understanding of how to architect and implement highly available, scalable, and secure network in multiple cloud environments
      • Strong affinity and experience in working with continuous deployment and continuous integration environments
      • An understanding around micro-service architectures and the complexities around their deployments 
      • Extensive programming experience in PHP, Ruby, Python, and Shell
      • Full stack troubleshooting and instrumentation experience
      • Extensive experience with IT automation technologies like Puppet, Salt, Chef, or Ansible
      • Experience with data aggregation, alerting, and reporting and supporting technologies such as Sensu and Graphite

      Nice to haves:
      • Experience in an on-call rotation
      • Experience with Elastic Search or Apache Solr
      • Experience with Spinnaker and/or other CI/CD tools
      • Previous experience as a mentor or advisor
      • Current contributor to open source projects (a Github account you can link us to would be ideal)
      We are an Equal Opportunity, Affirmative Action employer. Minorities, women, veterans and individuals with disabilities are encouraged to apply.

      Benefits and Perks:

      Beyond a stellar work environment, friendly people, and inspiring, innovative work, we have some great benefits and perks:
      Competitive salariesExcellent medical, dental and vision coverage for you and your family, along with other benefit plans like 401(k) matchUnlimited Paid Time OffWellness Reimbursement ProgramOnsite Programs, depending on location, such as Dry Cleaning, Car Washes, Massage, Yoga, and moreCareer & Personal Development Program – multi-platformRegular social eventsOwnership is the greatest self-identity at SugarCRM - you are making an impact nowWe are a merit-based company - many opportunities to learn, excel and grow your career
    • We have proudly been recognized by INC Magazine with the Best Workplaces award two years in a row! Read more about this achievement and what makes us special here: https://www.invoca.com/press/invoca-recognized-as-an-inc-magazine-best-workplace-for-the-second-consecutive-year/

      Commitment to our customers, collaboration, and continuous improvement in a positive environment are more than words written on a wall at Invoca, it’s our way of life. We take pride in an inclusive and egoless culture that helps us drive innovation and build value for both our customers and our people. And of course, there’s the competitive pay, great perks, and getting to work on an industry-leading product. If this sounds unlike most tech jobs you’ve had, you’re right. Come join us. We’re building something special.

      Job Description

      Invoca offers an unusually valuable engineering experience. You will be part of a team of world-class Engineers scaling our Information Security program with our rapidly growing company and SaaS application. Innovating new and creative ways to secure our platform and people. Our remote-first team is committed to upholding high standards via modern methodologies of agile software development, test-driven development, and DevOps.

      What you will do: 
      • Work with SRE team to deploy and maintain security tools for the organization
      • Execute web application and network penetration testing
      • Oversee vulnerability assessment platform
      • Innovate and assist in improving our Information Security framework
      • Assist with security awareness training (phishing tests, annual awareness, etc)
      What we are looking for:
      • A minimum of three years of experience in an Information Security team
      • A systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
      • Extensive experience within a production environment performing penetration tests on both systems and web applications
      • Knowledge of Docker, Kubernetes, and the challenges/benefits associated with containerization.
      • Hands-on experience with Configuration Management (e.g. Chef, Ansible, Puppet) and/or Infrastructure as Code (e.g. Terraform, CloudFormation).
      • A desire to create and write elegant, scalable, and maintainable tools and solutions.
       

       

      What is the team like?

      You’ll join a team where everyone, including you, is striving to constantly improve their knowledge of software development tools, practices, and processes. We are an incredibly supportive team. We swarm when problems arise and give great feedback to help each other grow. Working on our close-knit, cross-functional teams is a great chance to grow your knowledge of different domains from databases to front ends to telephony and everything in between.

       

      We are passionate about many things: continuous improvement, working at a brisk but sustainable pace, writing resilient code, maintaining production reliability, paying down technical debt, hiring fantastic teammates; and we love to share these passions with each other.

      Learn more about the Invoca development team on our blog and check out our open source projects.

       
      Diversity and inclusion statement

      “Our company is committed to creating a culture that is not only grounded in continuous learning, teamwork, and customer success, but is fair, equitable, and welcoming for everyone.” Gregg Johnson CEO

      And to us, diversity and inclusion means even more than treating current employees well and making them feel welcome. It also means proactively hiring people who bring different insights because of their unique demographics, ways of thinking, and prior experiences.

      We intend to continue hiring great people and protecting our culture so everyone can be themselves and speak their minds. That way Invoca will always be a place filled with laughter, energy, hard work, thoughtfulness and respect.

      We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

    • Doximity is transforming the healthcare industry. Our mission is to help doctors be more productive, informed, and connected. As a software engineer, you'll work within cross-functional delivery teams alongside other engineers, designers, and product managers in building software to help improve healthcare.  

      Our team brings a diverse set of technical and cultural backgrounds and we like to think pragmatically in choosing the tools most appropriate for the job at hand.

      About Us

      Here's How You Will Make an Impact

      • Improve the performance and scalability of services, optimize our REST and GraphQL APIs
      • Address security concerns and proficiently maintain our application stack
      • Troubleshoot issues across the whole stack, such as high-load, memory full, network issues and come up with temporary/long term solutions based on the root cause
      • Hands-on maintenance on our Ruby on Rails and Go (Golang) applications
      • Increase our automated test coverage and deployment infrastructure robustness 
      • Manage infrastructure using Chef and Terraform
      • Active involvement in design, implementation, and maintenance of the development, staging, and production infrastructure and services your team is responsible for
      • Create concise postmortems in the event of an outage
      • Write and maintain run-books for other engineers to leverage
      • Ensure proper security, monitoring, alerting, and reporting for the applications your team is responsible for
      • Collaborate with other engineers to make sound infrastructure decisions, improve workflow, and deploy applications ready for production
      • Monitor capacity, cost and plan for upgrades
      • Participate in an on-call rotation

      About you

      • You are a Ruby engineer at heart, very familiar and passionate about the Rails ecosystem
      • You are knowledgeable of memory and CPU profiling tools to help adjust Ruby jobs and processes to use resources effectively
      • You have experience working with Terraform and Chef (or similar tooling) either in a DevOps or product support capacity
      • You have experience deploying, configuring, and maintaining NGINX
      • You are proficient with Unix, AWS, and Git
      • You are self-motivated and able to manage yourself and your own queue
      • You are a problem solver with a passion for simple, clean, and maintainable solutions
      • You agree that concise and effective written and verbal communication is a must for a successful team
      • You are able to maintain a minimum of 5 hours overlap with 9:30 to 5:30 PM Pacific time
      • You can dedicate about two weeks per year for travel to company events

      Benefits & Perks

      • Generous time off policy
      • Comprehensive benefits including medical, vision, dental, Life/ADD, 401k, flex spending accounts, commuter benefits, equipment budget, and continuous education budget
      • Pre-IPO stock incentives
      • .. and much more! For a full list, see our career page

      More info on Doximity

      We’re thrilled to be named the Fastest Growing Company in the Bay Area, and one of Fast Company’s Most Innovative Companies. Joining Doximity means being part of an incredibly talented and humble team. We work on amazing products that over 70% of US doctors (and over one million healthcare professionals) use to make their busy lives a little easier. We’re driven by the goal of improving inefficiencies in our $3.5 trillion U.S. healthcare system and love creating technology that has a real, meaningful impact on people’s lives. To learn more about our team, culture, and users, check out our careers page, company blog, and engineering blog. We’re growing steadily, and there’s plenty of opportunity for you to make an impact.

      Doximity is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.

  • Product (1) Product rss feed

    • Auth0 is a pre-IPO unicorn. We are growing rapidly and looking for exceptional new team members to add to our teams and will help take us to the next level. One team, one score. 

      We never compromise on identity. You should never compromise yours either. We want you to bring your whole self to Auth0. If you’re passionate, practice radical transparency to build trust and respect, and thrive when you’re collaborating, experimenting and learning – this may be your ideal work environment.  We are looking for team members that want to help us build upon what we have accomplished so far and make it better every day.  N+1 > N.

      The Platform Engineering group at Auth0 builds the underlying technologies that power our Public and Private Cloud Platforms worldwide. The group is responsible for infrastructure, data storage, observability, SRE, provisioning, compute, orchestration platform, build/test/deploy, tools and services - all of the components that make up the Auth0 Platform.

      We’re looking for a technically savvy product manager to deliver solutions that empower customers to realize the value of the platform. This is a high-impact role that requires you to understand the business challenges and use-cases of our auth0 customers and developers to shape the product roadmap. Experience in cloud (AWS) infrastructure, storage, security, containerization/kubernetes, and CI/CD pipeline is highly desirable. A fierce curiosity and strong collaboration skills are your keys to success.

      You will:

        • Conduct product research and discovery with engineering teams.
        • Analyze and synthesize signals from multiple sources: users, field teams, market data, competitive analysis, and others.
        • Define the near and long-term strategy and socialize it with stakeholders.
        • Work daily as a member of a dedicated team with engineering and design, organized around a shared mission.
        • Develop and test product hypotheses, working in a lean and iterative way.
        • Define and track KPIs and success metrics for your product area.
        • Work with cross-functional partners in Product Marketing and our field teams to enable successful feature launches.
        • Create assets to guide product development work (framing documents, user story maps, opportunity canvases, stories for implementation).

      You might be a good fit for this role if you:

        • Have 5+ years of experience as a product manager, preferably in a startup environment, with a minimum of 5 years working in the software industry.
        • Has experience working on technology platforms, experience in building highly available and scalable large web software backends in cloud environments (Preferably AWS).
        • Has experience with microservice architecture and web application/services development.
        • Has experience working with DevOps teams, SRE teams and managing infrastructure running business-critical applications.
        • Has prior experience in working with one or more domains like SQL/NoSQL databases, full-stack web deployments, SaaS or PaaS deployments.
        • Has experience working with containers, kubernetes, container orchestrators, and cloud deployments.
        • Are a champion for collaborative, iterative product discovery, and embrace your role as a member of a cross-disciplinary team.

        • Have exposure to lightweight product development methods such as user story mapping or rapid prototyping.

        • Are curious about new technology and exhibit a strong desire to learn.

        • Have a degree of technical fluency that allows you to communicate with and understand your target audience (developers).

        • Love the work of identifying and deeply understanding customer problems.

        • Exhibit user empathy and seek their input at all stages of the product life cycle.

        • Are self-motivated and have experience working remotely.

        • Can travel domestically or internationally when required (15% or less).

        • Have experience working on Platform is a plus.







      Preferred Locations:




        • #US; #AR; #CA;






      Auth0’s mission is to help developers innovate faster. Every company is becoming a software company and developers are at the center of this shift. They need better tools and building blocks so they can stay focused on innovating. One of these building blocks is identity: authentication and authorization. That’s what we do. Our platform handles 2.5B logins per month for thousands of customers around the world. From indie makers to Fortune 500 companies, we can handle any use case.

      We like to think that we are helping make the internet safer.  We have raised $210M to date and are growing quickly. Our team is spread across more than 35 countries and we are proud to continually be recognized as a great place to work. Culture is critical to us, and we are transparent about our vision and principles. 

      Join us on this journey to make developers more productive while making the internet safer!
  • All others (1) All others rss feed

    • Together we’re building a company that will endure and products people will love for generations to come. 

      We believe that people do their best in a culture that fosters inclusion, innovation, and success. Our values - Champion the Customer, Take the Lead, Run Together, Ack + Own and Bring Yourself - serve as the foundation of our collaborative and dynamic culture. 

      Whether it’s conducting a retrospective, participating in our monthly Hackdays, cranking out a new product feature, supporting our two PagerDuty bands, or doing our day to day work, Dutonians live and breathe these five values every day. Together, we solve real customer issues and fulfill our mission of connecting teams to real-time opportunities and elevate work to the outcomes that matter.

      Solve for what’s next—at PagerDuty.

      The PagerDuty Expert Services team is focused on enabling our customers to most effectively leverage our platform to achieve their business goals. We partner with our key customers to provide large scale onboarding; custom integrations, service modeling, and provision users, teams, services, schedules, and escalation policies.

      As the company introduces a new approach to services delivery in a rapidly growing startup, this role will be instrumental in developing the process and technologies to deliver amazing customer experiences. You will help establish methodologies and repeatable processes to deliver successful implementations, every time. 

      About You
      • You’ve got technical chops. You are a technologist first. You demonstrate a deep knowledge of IT monitoring tools or within DevOps, SRE or IT Operations. You run the implementation process from design to delivery. You partner with customers to help design and build integrations to provide awesome implementations.
      • You are a problem solver. You identify potential roadblocks and provide thoughtful solutions. You are excellent at multi-tasking, are self-driven, and can work both independently and with a cross-functional team. You come up to speed quickly, love to learn, have a strong working style and impeccable attention to detail. You are comfortable running multiple simultaneous customer engagements and able to manage multiple threads within those engagements
      • You are an excellent and compelling communicator. You can break down complex technical concepts and explain them clearly to partners from business and technical backgrounds, from a DevOps engineer up to a C Level Executive. You have experience implementing technology solutions in the SaaS world and can articulate the solution to all levels in the customer organization
      • You are an extraordinary partner –  to sales, to product, to your team, to your customers. Depending on the situation, you play the part of project manager, architect, consultant, technical guru, product expert, leader, evangelist, and teacher, with a relentless commitment to outstanding customer service
      Ideal Qualifications
      • 5+ years of hands-on technical background with a primary emphasis on IT Operations / Professional Services delivery
      • Demonstrated Python and Javascript experience, especially within an AWS Lambda and stand-alone automation, scripting and tooling context
      • Demonstrated knowledge and ability to interact with common SaaS and traditional software APIs (REST, SOAP, WS), webhooks, etc. as part of scripting and tooling development, integration development, and ETL like activities.
      • Knowledge of infrastructure as code and DevOps SRE toolchains (GitHub, Terraform, Chef, Artifactory, JFrog, Nomad, Consul, Vault)
      • Ability to do advanced scripting (Python, Javascript, Go, Ruby, Perl) and fundamental knowledge of Linux. 
      • Experience with node.js, css, flexbox & bootstrap
      • Hands-on technical background using AWS (EC2, Lambda, S3, RDS, API Gateway, DynamoDB, IAM)
      • Deep technical knowledge with ITSM tools like ServiceNow, Jira, Remedy (ServiceNow Admin, ServiceNow Scripting, ServiceNow GScript/Rhino, Studio) 
      • Understanding of monitoring systems (DataDog, Dynatrace, Nagios, New Relic, Splunk, Zabbix)
      • You know and understand our space (or you’re already a fan of our product!).
      • Be prepared to give us a demo and show us what you've got!
      Please note: this position may be either remote or based in our Atlanta offices. The role will involve 25-50% travel

      PagerDuty offers:
      Competitive salaries and company equity
      Comprehensive benefits package including: medical, dental, and vision plans for you, your spouse and family
      401K with 1% match
      Pre-tax commuter benefits, FSA, cell phone allowance and more!
      Generous parental leave
      Paid vacation (3 weeks vacation your first year, 4 weeks afterwards) in addition to 12 paid holidays and ample sick leave
      Paid employee Volunteer Time - 20 hours per year
      Monthly company wide hack days

      PagerDuty is committed to creating a diverse environment and is an equal opportunity employer. PagerDuty does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, parental status, veteran status, or disability status.

      PagerDuty is committed to providing reasonable accommodations for qualified individuals with disabilities in our job application process.  Should you require accommodation, please email [email protected] and we will work with you to meet your accessibility needs.

      Our stewardship of the data of many thousands of customers means that a background check is required to join PagerDuty. We will, nonetheless, consider for employment qualified applicants with arrest and conviction records in a manner consistent with local requirements.

      PagerDuty uses the E-Verify employment verification program.

      To all recruitment agencies: PagerDuty does not accept agency resumes. Please do not forward resumes to our jobs alias, PagerDuty employees or any other company location. PagerDuty is not responsible for any fees related to unsolicited resumes.