earthmover transforms collections of tabular source data (flat files, FTP files, database tables/queries) into text-based (JSONL, XML) data via YAML configuration.
Quick-start
-
Install
earthmoverwith -
Create an
earthmover.ymlconfiguration file that defines your project config, data sources, transformations, and destinations: -
Create the
./json_templates/mydata.jsonttemplate file (which may use Jinja) to use when rendering the data for yourmydatadestination: -
Now run earthmover
and look for the output fileoutput/mydata.json.
How it works
earthmover is similar to dbt, though it executes data transformations locally using dataframes (rather than in a SQL engine). Like dbt, it creates a data dependency DAG from your earthmover.yml configuration and materializes output data in dependency-order.
Read more
Above is a simple quick-start example. Please read the documentation for more information about earthmover's many features, including:
- Resources for understanding
earthmover - How to install
earthmover - How to configure an
earthmoverproject - How to use
earthmover's commands and features - Best practices around
earthmoverprojects - Details about the design of
earthmover - How to contribute to
earthmover