Senior Data Engineer - Data Sourcing

RemoteFull TimeEngineeringExperienced

Who we are:

Revelio Labs provides workforce intelligence. We absorb and standardize hundreds of millions of public employment records to create the world’s first universal HR database, allowing us to see current workforce composition and trends of any company. Our customers include investors, corporate strategists, HR teams, and governments.

What we’re looking for:

Revelio Labs is looking for a data acquisition specialist who has proven expertise in data sourcing, web scraping, and data engineering. You will oversee and own the full lineage of data sources and feeds, from scraping and parsing to ingestion and model integration.

We are looking for someone who can design and implement web crawlers to source public data that will generate powerful insights on the labor market, integrate proprietary models to standardize and augment data sources, and increase the resiliency of existing data streams, as well as build a generalized framework to automate data acquisition, ingestion, and augmentation.

Experience and Skills:

Required:

  • Strong knowledge of Python and TypeScript
  • Experience building large-scale, high-volume web scrapers
  • Experience with bot prevention services and reverse engineering web applications
  • Deep understanding of web technologies, frameworks, and network protocols (HTML, JavaScript, HTTP, etc)

Preferred:

  • Experience building and maintaining ETL/ELT pipelines
  • Experience with cloud monitoring and administration tools
  • Familiarity with big data processing tools (e.g. Spark)
  • Knowledge of data warehouse maintenance best practices, including data wrangling, model integration, anomaly detection, and documentation
  • Knowledge of common CI/CD practices, including continuous build/test/deploy automation

Location:

Our offices are based in New York City, but the position can be done remotely.

Salary:

The pay range for this position in New York City is $120,000 - $200,000 per year. The salary range for performing this role outside of New York City may differ. Base pay offered may vary depending on job-related knowledge, skills, and experience. Additionally, you may be eligible to participate in our company’s equity program, plus benefits, including medical, dental, vision, retirement, and other. The range above is for the expectations as laid out in the job description, however we are often open to a wide variety of profiles, and recognize that the person we hire may be more senior or have different experience than this job description as posted. If that ends up being the case, the updated salary range will be communicated to you as a candidate.

Why you should work with us:

We are putting public data to incredible use, providing unparalleled insight into the workforces of companies and industries. You’ll work with a strong and innovative team of engineers who have created the most comprehensive set of labor market data available, and who will continue to discover valuable information in all corners of the public web.

How you should reach us:

Please email your resume to recruiting@reveliolabs.com as a PDF file. Please include your GitHub and highlight any projects that you’ve worked on that may be relevant.

Please find our CPRA Job Applicant Privacy Notice here.

Apply

Please find our CPRA Job Applicant Privacy Notice here.