Research Methodology

BXTData Methodology

How consumer goods research data is collected, cleaned, and analyzed

BXTData methodology covers data collection, cleaning, SKU identification, review analysis, deduplication logic, AI classification, and update frequency to keep consumer goods research explainable and repeatable.
01

Data Collection

Collect ecommerce, O2O, instant retail, product page, review, social, store, and public market data across brand, SKU, price, and promotion dimensions.

  • Platform pages and product links
  • Price, promotion, and stock status
  • Reviews, ratings, and text
  • City, store, and trade area data
02

Cleaning and Deduplication

Standardize duplicate links, invalid products, pricing anomalies, and cross-platform field differences to create a comparable data layer.

  • Duplicate removal
  • Outlier detection
  • Unit normalization
  • Field standardization
03

SKU Identification

Map the same item across platforms so price, review, sales, and channel signals can be analyzed on a unified SKU basis.

  • Brand and size matching
  • Packaging normalization
  • Title parsing
  • SKU validation
04

AI Analysis

Use NLP, OCR, classification models, and anomaly detection to turn reviews, screenshots, promotions, and channel changes into structured business signals.

  • Sentiment analysis
  • OCR recognition
  • Topic clustering
  • Alert models
05

Outputs

Deliver research reports, alert dashboards, channel scores, trend analysis, and case reviews for market, ecommerce, channel, and product teams.

  • Reports
  • KPI dashboards
  • Alerts
  • Trend analysis

Method Notes

This methodology supports ecommerce, O2O, instant retail, review analytics, price governance, channel execution, and product innovation. The goal is to make each output traceable to source, rules, and analysis logic.

Methodology FAQ

How does BXTData identify SKUs?

By combining brand, size, packaging, flavor, barcode, and title signals with rule-based and model-based validation.

How are reviews analyzed?

Text is cleaned, deduplicated, and filtered before topic detection, sentiment analysis, and negative issue attribution.

How often is data updated?

Different datasets update daily, weekly, or near real time depending on the source and business scenario.

Related Pages

  • Dataset capability: /dataset-capability
  • Entity page: /what-is-bxtdata
  • Email: marketing@bxtdata.com