How DataHawk Sources Your Data
Understand the difference between public and private data in DataHawk, why some data requires account connections, and typical freshness for each data type.
DataHawk pulls data from two fundamentally different places: Amazon and Walmart's public storefront, and your private seller or vendor accounts. Understanding this distinction helps you know what data you have access to, how fresh it is, and why some data requires account connections while other data is available immediately.
This page applies to both Amazon and Walmart unless otherwise noted.
Two types of data
Data with no account connection needed
DataHawk collects data across millions of products, such as:
- Product information: Titles, descriptions, images, ratings, review counts
- Pricing: Listed price, buy box price, competitor pricing
- Best Seller Rank (BSR): Category rankings
- Sales estimates: Estimated units sold per ASIN (DataHawk's own model, based on public signals)
- Keyword search volumes: Estimated monthly search frequency per keyword
- Organic and sponsored search rankings: Where a product appears for a given keyword
- Category structures: Amazon's browse node tree
You can access all of this for your own products and your competitors' products.
Data that requires account connection
Other data only exists inside your seller or vendor account; those data can only be accessed with your explicit authorisation via Amazon's or Walmart's API.
Private data includes:
- Orders and sales: Actual units sold, revenue, returns (from your account)
- Inventory levels: Stock on hand, inbound shipments, FBA storage
- Advertising data: Spend, impressions, clicks, ACoS, campaigns
- Profit and costs: Fees, COGS, net margin
- Vendor-specific data: Purchase orders, shipped cost of goods, vendor metrics
You need to connect your Seller Central, Vendor Central, or Advertising account to DataHawk. See the Set up data section for step-by-step instructions.
Why some data has a delay
Even after connecting your account, not all data appears instantly. The delay depends on how Amazon or Walmart makes data available through their APIs.
| Data type | Typical freshness | Reason |
|---|---|---|
| Product listings, pricing, BSR | Daily | DataHawk collects public page data daily |
| Keyword rankings | Daily | Collected per keyword, per marketplace |
| Sales estimates | Daily | Model runs on daily collected data |
| Orders (Seller) | Daily (D+1 to D+3) | Amazon SP-API has a settlement delay |
| Inventory | Daily | API snapshot |
| Advertising | Daily (D+2) | Amazon Ads API standard delay |
| Vendor metrics | Weekly | Vendor Central API update cadence |
"D+2" means data for a given day becomes available 2 days later. This is an Amazon-side constraint. DataHawk fetches data as soon as Amazon makes it available.
What happens if you don't connect an account
You can still use DataHawk without connecting any account. You'll have access to all public data. This is useful for product research, competitor monitoring, and keyword analysis.
However, without an account connection, you will not see:
- Your own sales or revenue figures
- Inventory alerts
- Advertising performance
- Profit and margin data
- Returns and removal data
DataHawk dashboards that display private data will show empty or placeholder values until the relevant account is connected.
Data collection across marketplaces
If you sell in multiple countries (e.g., US, UK, Germany), DataHawk collects data per marketplace independently. Connecting your US Seller account does not automatically pull UK data. Each marketplace requires its own configuration.
Technical reference
Need schema names, source APIs, account requirements, or freshness fields?