Managed Data-as-a-Service

Fully-managed Data-as-a-Service platform that allows your data team to move faster. Get high-quality operational data and insights in ready-to-use formats.

Available Datasets

Our data engineering team constructs and maintains retail-focused data models using data from numerous Amazon APIs, the Amazon website, and Walmart.
This allows us to provide you with ready-to-use datasets.
When you access your DataHawk database, you'll notice that each theme is represented by a distinct schema.

Product - All your tracked products' data

All the tables included in the product schema will allow you to monitor and benchmark product listings and performance changes.

SEO - All your tracked keywords' data

Access all historical keywords' rank data and optimize products' organic and sponsored search performance. Also, access the historical keyword search volume of your tracked keywords.

Finance - Sales and profit & loss data

All your orders are in one place from all your marketplaces. One beautiful and meticulously unified model for Amazon and merchant-fulfilled orders.
In addition to orders and sales, access Profit & Loss and detailed financial events tables.

🚧

Financial Data

For this data to be available, you need to connect your Seller Central account to DataHawk. This will unveil 2 years of historical data for the accounts connected.

Advertising - Account, campaign, and product-level data

All your costs, sales, clicks, impressions, orders, units sold on a product, campaign, and account level data. Monitor and analyze your ads performance to optimize your ads cost.

🚧

Advertising Data

For this data to be available, you need to connect your Advertiser account to DataHawk. This will unveil 3 months of historical data for the accounts connected.

Raw - Consume data exactly as it was extracted from Amazon

Raw schemas contain data straight from Amazon's own services (e.g., Seller Central Reports), with minimal transformations and/or aggregations. This data is useful for performing more complex analyses or consuming data types whose schemas and intelligence are still under development by DataHawk. For now, Raw data contains:

  • Advertising Data - Shows all campaigns run in much more granular detail. This schema is currently available only upon request.
  • Inventory Data - Shows current and historical stock per SKU, as well as re-stocking recommendations generated by Amazon.

Reports - Smart way to join data sources to answer use cases

These tables join several themes (i.e., estimated sales and sales ranks) to answer a distinct use case.

Referential - Everything related to your DataHawk account and more!

This schema contains:

  1. DataHawk workspace-related information: tracked items, tags, Seller Central & Advertising accounts connected.
  2. Repositories about marketplaces, browse node tree, and currency rate. These are useful to add extra information to other data sources, such as browse node details for sales rank data or currency rates to sales data.

Selling Partner - Seller-specific and Vendor-specific metrics all in one schema

At present, this schema contains Seller Traffic data, as well as Vendor data (Sales, Inventory, Traffic, Margins, and Forecasts). This is an in-progress schema that will soon consolidate all data pertaining to sellers and vendors. While still under construction, you may notice duplicates between this schema and the PREVIEW_RAW_SELLER and PREVIEW_RAW_VENDOR schemas.

Usage - Monitor your database credit usage

You'll have access to your remaining monthly query credits and the list of all queries made on your database.

Preview Schemas - See new data sources we're currently building

At present, there exist three Preview schemas. The PREVIEW_RAW_SELLERschema contains Seller Central data, such as Traffic & Sessions data; The PREVIEW_RAW_VENDORschema contains Vendor Central data, such as data pertaining to Sales, Inventory, and Traffic; and finally, the PREVIEW schema contains all other non-specific data currently in the works. These schemas are not permanent, and may be deprecated in the future.

Frequently asked questions

Why do you provide destinations?

As Marketplaces and Marketplace Sellers grow, so will the complexity surrounding multiple APIs provided by Marketplace operators. Our platform helps your Data team move faster by giving a normalized and documented data foundation. We handle the API rate limits, evolving schemas, tests, transformations, data back-filling, and all other tedious tasks.

Who are destinations for?

Brand managers can use destinations to send ready-to-use and customizable analytics to a Report Template. The hosted database can also be directly used by data teams from Amazon Sellers, Aggregators, or Agencies. Technical users can build their own reports and tools on any compatible destination.

Can I use the datasets for ERP or CRM integrations?

Definitely! Our customers use the hosted database for Oracle NetSuite ERP, Salesforce Service Cloud CRM, and Marketing Cloud integrations.

How do you deliver the data?

We currently use Snowflake and Google BigQuery for data sharing. Amazon Redshift, and Databricks will soon be supported.

My data science team spends too much time on data preparation. Are the datasets ready for ML model building?

Yes. Connect your Jupyter or hex.tech workbook to your data warehouse, and start building!

How to visualize and build reports on top of these datasets?

You can use any of the modern BI/Reporting tools. We’ve seen our customers using PowerBI, Google Sheets, Excel, Tableau, Apache Superset, and Metabase. Many more are available, such as Qlik, Looker, Google Data Studio, Holistics, etc.