Data Scientist

SimonData


4 months ago

06/30/2019 10:21:23

Job type: Full-time

Hiring from: US only

Category: All others


ABOUT US

Simon Data was founded in 2015 by a team of successful serial entrepreneurs. We're a data-first marketing platform startup, and we approach our work seriously; we tackle problems in a scrappy and disruptive fashion, yet we build for scale to support our clients at big data volume.

We are the first and only enterprise customer data platform with a fully-integrated marketing cloud. Moving beyond the limitations of both categories, Simon’s platform empowers businesses to leverage enterprise-scale big data and machine learning to power customer communications in any channel. Simon’s unique approach allows brands to develop incredible personalization capabilities without needing to build and maintain massive bespoke data infrastructure.

Our culture is rooted in organizational transparency, empowering individuals, and an attitude of getting things done. If you want to be a valuable contributor on a team that cultivates these core values we would love to hear from you.

THE ROLE

As a Data Scientist at Simon, you will be working as part of a collaborative/user focused team and be responsible for designing and building smart systems that drive revenue—our statistical models are at the core of our product, and will only become more so as we continue to develop and add features.  We take an approach to ML that is data-first, and requires principled modeling decisions: we don’t believe in theory-crafting models before we have collected the data that will power them, as well as built out the business process that will continue to generate that data. In the model building process, we prioritize interpretable models whose training and performance yield insights about the underlying process, along with optimizing the selected objective.

Our technologies of choice are Python in the backend and React/Redux in the frontend, and our tech stack includes Django, MySQL, Redshift, S3, DynamoDB, and Elasticsearch storage, asynchronous tasks over RabbitMQ, and distributed data processing over Elastic MapReduce and Spark.

WHAT YOU’LL DO

  • Build ML products that leverage Simon’s extraordinary data access to drive real business value

  • Build high-quality statistical models by executing the entire model-building process, including data cleaning, feature extraction, model selection, and predictive validation

  • Contribute to the tooling and interfaces used to support the data science process at Simon

  • Represent Simon DS in conversations with stakeholders at our client companies

  • Advance Simon as a thought leader in data science, by writing blog posts and papers, and presenting at industry conferences

  • Guide internal product and technology strategy by representing data science perspectives and requirements in conversations with your peers

QUALIFICATIONS

  • Ph.D. in Statistics/Machine Learning, or equivalent

  • Excellent communication of statistical concepts to expert & non-expert audiences

  • Broad and up-to-date knowledge of machine learning models (and their performance characteristics) for classification and regression tasks

  • Specific experience designing and building machine-learning models

  • Fluency in at least one statistical coding environment (numpy/pandas, R, etc.)

  • Comfort coding in at least one non-statistical language (e.g. Python or Java, not R or Matlab)

  • Fluency in SQL

  • Production-level software engineering experience is a plus

  • Expertise in causal inference, experiment design, reinforcement learning, and related fields is a plus

Visa sponsorship for this role is currently not available.

Please mention that you come from Remotive when applying for this job.

Help us maintain Remotive! If this link is broken, please just click to report dead link!

similar jobs

  • Skylight (https://skylight.digital/) is at the forefront of a civic movement to reinvent how the government serves the public in a digital world.

    We’re looking for a Data Scientist to join our talented team of technologists in driving this movement forward.

    You’ll be a key part of our small, but rapidly growing team, which consists of former Presidential Innovation Fellows, founders of 18F, and members of the U.S. Digital Service.

    We work in small, fast, agile teams to create exceptional customer experiences and enduring solutions out of the government’s most complex design and technology challenges. The work is challenging, but highly rewarding.

    Requirements

    What you’ll do:

    • Identify valuable data sources and automate collection processes

    • Undertake preprocessing of structured and unstructured data

    • Analyze large amounts of information to discover trends and patterns

    • Build predictive models and machine-learning algorithms

    • Combine models through ensemble modeling

    • Present information using data visualization techniques

    • Propose solutions and strategies to business challenges

    • Select and use the right tools, frameworks, languages, and technologies for the job, with a preference for open-source solutions

    • Collaborate with others as part of a cross-functional team that includes user experience researchers and designers, product managers, engineers, and other functional specialists

    • Represent Skylight's culture of delivery when interacting with government stakeholders and other contractors

    What we’re looking for:

    • In-depth knowledge of computer science, statistics, and/or mathematics

    • Experience using advanced statistical and data-mining techniques such as regression, properties of distributions, and statistical tests

    • Expertise in programming languages/frameworks such as Java/Scala, R, Python, and SQL

    • Experience with distributed data/computing tools such as Spark, Hadoop, and Hive

    • Experience using web services such as Redshift and S3

    • Expertise in machine learning and deep-learning libraries such as Scikit-Learn, TensorFlow, Keras, PyTorch, and Apache Spark MLlib

    • Experience using business intelligence tools such as Tableau, SAS, Microstrategy, and Looker

    • Experience working with notebooks such as Jupyter, Zeppelin, and Databricks Notebook

    • Experience visualizing/presenting data to stakeholders using tools such as Matplotlib, Seaborn, and D3

    • Strong analytical and problem-solving skills

    • Strong business acumen and curiosity

    • A drive to learn and master new technologies and techniques

    • Ability to select and use the right tools for the job, particularly open-source solutions

    • Ability to communicate clearly to technical and non-technical audiences

    • Experience working within a multidisciplinary, agile team format

    • A mindset and work approach that aligns with our core values (https://skylight.digital/culture/)

    • Able to travel from time to time

    Benefits

    We focus on supporting you in a variety of ways:

    • Competitive salary

    • Profit-sharing and/or bonus opportunities

    • Health insurance, including medical, dental, vision, and more

    • 401k match at 10% of your salary

    • Unlimited paid time-off policy

    • $2,000 continuing education allowance, including conference events

    • Incentives for living in a HUBZone area (https://maps.certify.sba.gov/hubzone/map), including relocation assistance and a monthly stipend to help offset the cost of rent or mortgage. (Read more about us being a HUBZone: https://skylight.digital/about/#hubzone.)

    • Time to focus on activities such as learning & development, open-source projects, and community outreach

    • An environment that empowers you to unleash your superpowers for public good

    • Note that we participate in E-Verify and upon hire, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.

  • Yesterday

    The data analyst role at GitLab is a hybrid role: part data analyst, part data scientist, part data warehouse engineer, and part backend engineer.

    This role will require an analytical and business-oriented mindset with the ability to implement rigorous database solutions and best practices in order to produce and influence the adoption of strong quality data insights to drive business decisions in all areas of GitLab.

    Responsibilities

    • Collaborate with other functions across the company by building reports and dashboards with useful analyses and strong data insights

    • Explain trends across data sources, potential opportunities for growth or improvement, and data caveats for descriptive, diagnostic, predictive (including forecasting), and prescriptive data analysis

    • Deep understanding of how data is created and transformed through GitLab products and services provided by third-parties to help drive product designs or service usage or note impacts to data reporting capabilities

    • Understand and document the full lifecycle of data and our common data framework so that all data can be integrated, modeled for easy analysis, and analyzed for data insights

    • Document every action in either issue/MR templates, the handbook, or READMEs so your learnings turn into repeatable actions and then into automation following the GitLab tradition of handbook first!

    • Expand our database with clean data (ready for analysis) by implementing data quality tests while continuously reviewing, optimizing, and refactoring existing data models

    • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment. Maintain and advocate for these standards through code review

    • Provide data modeling expertise to all GitLab teams through code reviews, pairing, and training to help deliver optimal, DRY, and scalable database designs and queries in Snowflake and in Periscope

    • Approve data model changes as a Data Team 

    • Reviewer and code owner for specific database and data model schemas

    • Own the end-to-end process of on-call data triaging from reading Airflow logs, to diagnosing the data issue, and to verifying and implementing a solution with an automated alerting system (ChatOps, etc) as well as providing data support for all GitLab members

    • Contribute to and implement data warehouse and data modeling best practices, keeping reliability, performance, scalability, security, automation, and version control in mind

    • Follow and improve our processes and workflows for maintaining high quality data and reporting while implementing the DataOps philosophy in everything you do

    This position reports to the Manager, Data

    Requirements

    • 2+ years experience in an analytics role

    • Experience building reports and dashboards in a data visualization tool

    • Passionate about data, analytics and automation. Experience cleaning and modeling large quantities of raw, disorganized data (we use dbt)

    • Experience with a variety of data sources. Our data includes Salesforce, Zuora, Zendesk, Marketo, NetSuite, Snowplow and many others (see the data team page)

    • Demonstrate capacity to clearly and concisely communicate complex business logic, technical requirements, and design recommendations through iterative solutions

    • Deep understanding of SQL in analytical data warehouses (we use Snowflake SQL) and in business intelligence tools (we use Periscope)

    • Hands on experience working with SQL, Python, API calls, and JSON, to generate business insights and drive better organizational decision making

    • Familiarity with Git and the command line

    • Deep understanding of relational and non-relational databases, SQL and query optimization techniques, and demonstrated ability to both diagnose and prevent performance problems

    • Effective communication and collaboration skills, including clear status updates

    • Positive and solution-oriented mindset

    • Comfort working in a highly agile, intensely iterative environment

    • Self-motivated and self-managing, with strong organizational skills

    • Ability to thrive in a fully remote organization

    • Share and work in accordance with our values

    • Successful completion of a background check

    Product

    • Support the Product function by spearheading tracking and reporting initiatives

    • Focus on product usage metrics across SaaS and self-managed products

    • Build cross-functional analyses to drive strategic decision-making

    • Priorities will be set by a Director of Product but will collaborate with and report into the Data Team

  • ABOUT US

    Simon Data was founded in 2015 by a team of successful serial entrepreneurs. We're a data-first marketing platform startup, and we approach our work seriously; we tackle problems in a scrappy and disruptive fashion, yet we build for scale to support our clients at big data volume.

    We are the first and only enterprise customer data platform with a fully-integrated marketing cloud. Moving beyond the limitations of both categories, Simon’s platform empowers businesses to leverage enterprise-scale big data and machine learning to power customer communications in any channel. Simon’s unique approach allows brands to develop incredible personalization capabilities without needing to build and maintain massive bespoke data infrastructure.

    Our culture is rooted in organizational transparency, empowering individuals, and an attitude of getting things done. If you want to be a valuable contributor on a team that cultivates these core values we would love to hear from you.

    THE ROLE

    Does your perfect job involve leading a team of hardworking data scientists responsible for the ML models driving core functionality? As well as one in which success in your role will be essential to the company? Simon is looking for a thoughtful data science manager to partner with all parts of the business and do just that.

    As a successful leader in this role, you'll use your intimacy with the business' goals to help structure sprint team memberships quarter over quarter as well as manage programs. Drawing upon years of experience as a data scientist, you will collaborate individually with your team members to develop their statistical and engineering skills. You will act as a coach, guide, and recruiter as you build a team of influential data scientists who are role models to their peers in both exemplifying our values & competencies as well as consistently delivering high impact business value.

    We move quickly (multiple deploys throughout the day) and constantly work to reduce waste as well as structure ourselves around the most important goals for the business. Our data science team currently comprises roughly 7 data scientists of diverse backgrounds and skill sets.

    WHAT YOU'LL DO

    • Lead a multi-functional team of junior to staff data scientists to help them grow the next generation of the company’s platform

    • Contribute to building and delivering resource allocations targeted to continue driving the engineering organization forward

    • Drive growth of the team by playing a pivotal role in finding, attracting and hiring talented individuals to Simon

    • Collaborate with the leadership team to produce plans for team personal development, growth, and sustained value

    • Lead critical processes, scale our architecture, or lead design decisions

    QUALIFICATIONS

    • 3+ years of management experience

    • 3+ years of engineering experience as a data scientist or data engineer & comfort with at least one mainstream programming language (Python, Java, Scala, C#, Ruby, etc.), and one statistical programming environment (Jupyter/pandas, Tensorflow, RStudio, etc.)

    • M.S. in a statistical/quantitative field, or equivalent professional experience

    • Proven ability to successfully lead teams & projects

    • Reasonable exposure to & understanding of some form of Agile or modern program management

    • Strong communication skills

    • A fervent appreciation for organization

    • Thrive in a fast-paced environment

    • A passion for working with data

    • Thoughtful, curious and a problem solver

    • Personable, collaborative, and a sense of humor

    Diversity

    We’re proud to be an equal opportunity employer open to all qualified applicants regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or expression, Veteran status, or any other legally protected status.

Remotive can help!

Not sure how to apply properly to this job? Watch our live webinar « 3 Mistakes to Avoid When Looking For A Remote Startup Job (And What To Do Instead) ».

Interested to chat with Remote workers? Join our community!