Data Engineer

Legalist


3 months ago

10/04/2019 10:21:23

Job type: Full-time

Hiring from: US & Europe

Salary: $80k – $100k

Category: All others


Legalist is breaking new ground in FinTech and LegalTech. Data Science is one of the pillars of Legalist's continuous innovation, and we're looking for someone who can lead the charge on that front.

You will get to..

  • Use Python, PyCharm, Jupyter to build our products

  • Use AWS, GCP, Kubernetes, Docker, Jenkins to scale our infrastructure

  • Learn at the bleeding edge of web and machine learning technologies

  • Work with a ton of legal data to build analytical tools that support the business team

Legalist is an investment firm that uses tech to invest in lawsuits. We graduated YCombinator as part of their Summer 2016 batch, and have since garnered international press for our pioneering work. You can read about us in NYT, WSJ, The Guardian, Le Monde, The Economist, and many others.

If you're interested in the intersection of finance, technology, and law, then you'll find the problems we work on highly interesting. We scrape millions of court records and build technology that streamlines the process of investing in legal assets, while running investment funds that generate high returns for our investors.

YOUR MISSION

Ideally you will be interested in learning, be proactive, and enjoy using bleeding edge technologies. Formal 'experience' not necessary but demonstration of capability required. It is a well paid role, with salary based on capability. You will be working alongside the CTO & co-founder and 4 talented engineers.

Currently, our platform is based around a backend microservices architecture for different use cases.

We're really looking for people who would love to join a fast growing startup with great financial projections, paying clients, strong investors, and an awesome team where you can always be learning. Our work is multi-disciplinary, and we're looking for engineers with an interest in business as well.

RESPONSIBILITIES

  • Autonomy over a core product and data pipeline

  • Collaborate with engineers to develop and ship features

  • Write efficient, modular, and reusable libraries and abstractions

  • Identify key drivers & insights to improve our analytics engines

  • Participate in code reviews

QUALIFICATIONS

Applicants are not expected to show advanced understanding of all of the below, but must show willingness, ability, and interest in keeping up with cutting edge technologies and frameworks.

  • 4+ years of experience with machine learning and data science techniques

  • Degree in Computer Science, Statistics, Mathematics or equivalent field

  • Ability to implement best practices

  • Ability to identify key insights and technologies

  • Comfort with independently building out MVPs which can then be built out and supported by the engineering team

  • Experience working with modern data stores such as NoSQL/Postgres, S3, Cassandra or similar

  • Experience working with Cloud Computing technologies (e.g. AWS, Azure, GCP)

  • Ability to communicate technical specifications both verbal and written

Please mention that you come from Remotive when applying for this job.

Help us maintain Remotive! If this link is broken, please just click to report dead link!

similar jobs

  • Kalepa is looking for Data Scientists to lead efforts at the intersection of machine learning and big data engineering in order to solve some of the biggest problems in commercial insurance.

    Data scientists at Kalepa will be turning vast amounts of structured and unstructured data from many sources (web data, geolocation, satellite imaging, etc.) into novel insights about behavior and risk. You will be working closely with a small team in designing, building, and deploying machine learning models to tackle our customers’ questions.

    Kalepa is a New York based, VC backed, startup building software to transform and disrupt commercial insurance. Nearly one trillion ($1T) dollars are spent globally each year on commercial insurance across small, medium, and large enterprises. However, the process for estimating the risk associated with a given business across various perils (e.g. fire, injury, malpractice) is still reliant on inefficient and inaccurate manual forms or outdated and sparse databases. This information asymmetry leads to a broken set of economic incentives and a poor experience for both businesses and insurers alike. By combining cutting edge data science, enterprise software, and insurance expertise, Kalepa is delivering precision underwriting at scale – empowering every commercial insurance underwriter to be as effective and efficient as possible. Kalepa is turning real-world data into a complete understanding of risk.

    Kalepa is led by a strong team with experiences from Facebook, APT (acquired by Mastercard for $600M in 2015), the Israel Defense Forces, MIT, Berkeley, and UPenn.

    About you:

    ● You want to design a flexible analytics, data science, and AI framework to transform the insurance industry

    ● You have demonstrated success in delivering analytical projects, including structuring and conducting analyses to generate business insights and recommendations

    ● You have in-depth understanding of applied machine learning algorithms and statistics

    ● You are experienced in Python and its major data science libraries, and have deployed models and algorithms in production

    ● You have a good understanding of SQL and non-SQL databases

    ● You value open, frank, and respectful communication

    ● You are a proactive and collaborative problem solver with a “can do” attitude

    ● You have a sincere interest in working at a startup and scaling with the company as we grow

    As a plus:

    • You have experience in NLP and/or computer vision

    • You have familiarity with Spark, Hadoop, or Scala

    • You have experience working with AWS tools

    What you’ll get

    ● Work with an ambitious, smart, and fun team to transform a $1T global industry

    ● Ground floor opportunity – opportunity to build the foundations for the product, team, and culture alongside the founding team

    ● Wide-ranging intellectual challenges working with large and diverse data sets, as well as with a modern technology stack

    ● Competitive compensation package with a significant equity component

    ● Full benefits package, including excellent medical, dental, and vision insurance

    ● Unlimited vacation and flexible remote work policies

    ● Continuing education credits and a healthy living / gym monthly stipend

    [IMPORTANT NOTE]: Salary ranges are for New York based employees. Compensation for remote roles will be adjusted according to the cost of living and market in the specific geography.

  • Auth0 (US or Argentina)
    1 month ago
    Auth0 is a pre-IPO unicorn. We are growing rapidly and looking for exceptional new team members to add to our teams and will help take us to the next level. One team, one score. 

    We never compromise on identity. You should never compromise yours either. We want you to bring your whole self to Auth0. If you’re passionate, practice radical transparency to build trust and respect, and thrive when you’re collaborating, experimenting and learning – this may be your ideal work environment.  We are looking for team members that want to help us build upon what we have accomplished so far and make it better every day.  N+1 > N.

    The Data engineer will help build, scale and maintain the enterprise data warehouse. The ideal candidate will have a deep understanding of technical and functional designs for Databases, Data Warehousing and Reporting areas. The candidate should feed on challenges and love to be hands on with recent technologies.

    This job plays a key role in data infrastructure, analytics projects, and systems design and development. You should be passionate for continuous learning, experimenting, applying and contributing towards cutting edge open source Data technologies and software paradigms.

    Responsibilities:

    • Contributing at a senior-level to the data warehouse design and data preparation by implementing a solid, robust, extensible design that supports key business flows.
    • Performing all of the necessary data transformations to populate data into a warehouse table structure that is optimized for reporting.
    • Establishing efficient design and programming patterns for engineers as well as for non-technical peoples.
    • Designing, integrating and documenting technical components for seamless data extraction and analysis.
    • Ensuring best practices that can be adopted in our data systems and share across teams.
    • Contributing to innovations and data insights that fuel Auth0’s mission.
    • Working in a team environment, interact with multiple groups on a daily basis (very strong communication skills).

    Skills and Abilities:

    • + BA/BS in Computer Science, related technical field or equivalent practical experience.
    • At least 4 years of relevant work experience
    • Ability to write, analyze, and debug SQL queries.
    • Exceptional Problem solving and analytical skills.
    • Experience with Data Warehouse design, ETL (Extraction, Transformation & Load), architecting efficient software designs for DW platform.
    • Knowledge of database modeling and design in a Data Warehousing context
    • Strong familiarity with data warehouse best practices.
    • Proficiency in Python and/or R.


    Preferred Locations:

    • #AR; #US;
    Auth0’s mission is to help developers innovate faster. Every company is becoming a software company and developers are at the center of this shift. They need better tools and building blocks so they can stay focused on innovating. One of these building blocks is identity: authentication and authorization. That’s what we do. Our platform handles 2.5B logins per month for thousands of customers around the world. From indie makers to Fortune 500 companies, we can handle any use case.

    We like to think that we are helping make the internet safer.  We have raised $210M to date and are growing quickly. Our team is spread across more than 35 countries and we are proud to continually be recognized as a great place to work. Culture is critical to us, and we are transparent about our vision and principles. 

    Join us on this journey to make developers more productive while making the internet safer!
  • 1 month ago

    Summary 

    Wikipedia is where the world turns to understand almost any topic — The Wikimedia Foundation is the nonprofit that operates Wikipedia with a small staff.  We are looking for a great data architect who wants to modernize the infrastructure underlying Wikipedia with distributed storage, services and REST interfaces.  If this excites you, we welcome you to join us.

    Description

    • Collaborate with Product Owners, Engineers and stakeholders on product discovery and improvements of our existing systems
    • Design and implement effective data storage solutions and models
    • Articulate the flow of data across our diverse range of systems
    • Ensure reusable clear service design and  documentation
    • Defining and aligning the forms and sources of data to facilitate WMF initiatives
    • Ensure monitoring system performance and identify, define and implement internal process improvements and SLOs
    • Work with Site Reliability and Operations Engineers to analyse and determine service discoverability, capacity plans and high availability
    • Recommend solutions to improve new and existing data storage and delivery systems
    • Change the world for more than half a billion people every month ;) 

    Skills and Experience

    • 3+ years experience in a Data Architect role as part of a team
    • You have a track record of leading data architecture initiatives to completion
    • You have experience analysing, reasoning about, optimising and implementing complex data systems
    • You have expertise in data handling approaches and technologies with good understanding of system development lifecycles and modern data architectures(Data Lakes, Data Warehouse)
    • You are comfortable modeling complex systems using approaches such as Domain Driven Design, eventual consistency, stream processing
    • You have experience with a diverse set of data storage and persistence frameworks and have a strong understanding of core data modelling concepts:
      • Relational & distributed databases (e.g. MySQL, Cassandra, Neo4j, Riak, HBase, DynamoDB, Elasticsearch)
      • Consistency trade-offs and transactional algorithms in distributed systems
      • Principles of fault tolerance and robustness
    • Use the best available tools & languages for each task. Currently we work a lot with Node.js but also use other tools and languages like Go, Python, Java, C, C++ and PHP where it makes sense. 
    • You have experience working with data streaming and pipelining systems(Hadoop, Kafka, Druid)
    • You have experience working with an engineering team, and communicate effectively with other stakeholders.
    • You have a track record of combining a solid long-term architectural strategy with short-term progress.
    • With freedom comes responsibility. You direct your own work and are pro-active in asking for input.
    • You have a scientific mindset and empirically test your hypotheses.
    • BS, MS, or PhD in Computer Science or equivalent work experience

    Pluses

    • Experience working with microservice architectures
    • Experience with open source technology and free culture, and have contributed to open source projects
    • Experience working remotely
    • You know what it means to be a volunteer or to coordinate the work of volunteers
    • Big ups if you are a contributor to Wikipedia
    • Please provide us with information you feel would be useful to us in gaining a better understanding of your technical background and accomplishments

    Show us your stuff! If you have any existing open source software that you've developed (these could be your own software or patches to other packages), please share the URLs for the source. Links to GitHub, etc. are exceptionally useful. 

    The Wikimedia Foundation is... 

    ...the nonprofit organization that hosts and operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference. We host the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive financial support from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.

    The Wikimedia Foundation is an equal opportunity employer, and we encourage people with a diverse range of backgrounds to apply.

    U.S. Benefits & Perks*

    • Fully paid medical, dental and vision coverage for employees and their eligible families (yes, fully paid premiums!)
    • The Wellness Program provides reimbursement for mind, body and soul activities such as fitness memberships, baby sitting, continuing education and much more
    • The 401(k) retirement plan offers matched contributions at 4% of annual salary
    • Flexible and generous time off - vacation, sick and volunteer days, plus 19 paid holidays - including the last week of the year.
    • Family friendly! 100% paid new parent leave for seven weeks plus an additional five weeks for pregnancy, flexible options to phase back in after leave, fully equipped lactation room.
    • For those emergency moments - long and short term disability, life insurance (2x salary) and an employee assistance program
    • Pre-tax savings plans for health care, child care, elder care, public transportation and parking expenses
    • Telecommuting and flexible work schedules available
    • Appropriate fuel for thinking and coding (aka, a pantry full of treats) and monthly massages to help staff relax
    • Great colleagues - diverse staff and contractors speaking dozens of languages from around the world, fantastic intellectual discourse, mission-driven and intensely passionate people

    *Eligible international workers' benefits are specific to their location and dependent on their employer of record

    More information

    Wikimedia Foundation
    Blog
    Wikimedia 2030
    Wikimedia Medium Term Plan
    Diversity and inclusion information for Wikimedia workers, by the numbers
    Wikimania 2019
    Annual Report - 2017 

    This is Wikimedia Foundation 
    Facts Matter
    Our Projects
    Fundraising Report

Remotive can help!

Not sure how to apply properly to this job? Watch our live webinar « 3 Mistakes to Avoid When Looking For A Remote Startup Job (And What To Do Instead) ».

Interested to chat with Remote workers? Join our community!