Company logo

Data Researcher

Full-time
New York City, United States

Cybersyn is a new DaaS (data-as-a-service) company, backed by Sequoia, Coatue, and Snowflake. Cybersyn's mission is to make the world's economic data transparent to governments, businesses, and entrepreneurs and enable a new generation of decision makers. We acquire unique data assets (companies, licenses, data rights, consumer dividends) and build derived products on top of that, focusing on measuring what consumers and businesses are spending money on. You can think of Cybersyn as a cross between an investment firm and a technology company focused on data: if we are successful, we will disrupt the likes of Nielsen and S&P. The reward is great - if we are successful, we can disrupt an industry worth $100Bs and build SimCity for the real world.


We have already released a fair number of public datasets that we have cleaned, restructured and made joinable on the Snowflake Marketplace.

  • See our current data here: https://app.snowflake.com/marketplace/listings/Cybersyn%2C%20Inc
  • Demo our data here: https://cybersyn-datacommons.streamlit.app/


Who you are:

  • Pragmatic and commercially minded data researcher, interested in getting to actionable insights over methodology or technology
  • Experience working with multiple (external) datasets, cleaning, joining, and munging data; experience working with public data sources (ie. US Census, ACS Survey) a huge plus
  • Experience in Python (or R) and SQL is requisite; ideally has worked with cloud data warehouses before (Snowflake, BigQuery, Redshift, etc.)
  • Familiarity with basic statistical concepts (linear regression)
  • Experience in dbt, AWS, Github all very useful, but not strictly required
  • Articulate, pragmatic, and scrappy; must be willing to wear multiple hats and change context frequently


What you will do:

  • Work with customers, data sources, and technical team to create derived data products that answer business questions; in practice, this means creating data pipelines out of SQL, Python, dbt, and orchestration tools
  • Build prototypes of entity resolution, data normalization, and statistical poststratification algorithms and oversee the implementation of those prototypes
  • Build data visualizations using Vega-lite and Streamlit to demonstrate data products


What you get out of it:

  • Ability to shape Cybersyn’s initial product, technology decisions
  • Access to some of the most interesting economic data in the world, including real-time spending, transaction, clickstream, data from both third-party and first-party sources. Much of our data is not available to any other third parties
  • Fast moving culture, lots of responsibility and autonomy from day 1
  • Much of our data is not available to any other third parties
  • Our system is built with heterogeneous data sources in mind: we are not working on data from a single product or theme, but data from governments, payment processing systems, to mobile devices


Compensation:

  • We offer both salary & equity options
  • Total compensation: $100-300k

Location:

  • New York City Full-Time
  • No remote
Apply here:
* Required fields
Powered by
Jobspage