Research Methodology

Last updated: April 2026

Transparency is core to our work. This page explains exactly how we collect, verify, and publish data. If our methodology has a weakness, we want you to see it — and we want to hear about it.

How We Collect Pricing Data

Distributed Collection Network

Our pricing data is collected by a network of automated research workers running across multiple server locations. Each worker uses diverse IP ranges (residential proxies across multiple geographic regions) to avoid sampling bias from any single network vantage point.

Workers query provider websites, industry pricing databases, and verified quote aggregators. Each data point records the source, collection timestamp, and geographic scope.

Historical Price Archives

We extract historical pricing data from archived web pages dating back to 2013 using the Wayback Machine and other archival services. This allows us to track how service costs have changed over more than a decade — not just report current prices.

Historical data is clearly labeled with its archive date and source URL. Confidence scores for historical data reflect the age and completeness of the archived page.

How We Analyze Consumer Sentiment

Multi-Platform Review Analysis

We collect consumer discussions and reviews from three primary platforms:

  • Reddit — Subreddit discussions where consumers share unfiltered experiences (r/homeimprovement, r/legaladvice, r/personalfinance, and industry-specific subreddits)
  • Yelp — Business reviews with star ratings and detailed experience descriptions
  • Google Maps — Local business reviews with geographic specificity

Reviews are analyzed using large language models to extract recurring themes — both positive and negative — with frequency percentages. Results are presented as "X% of reviews mention [theme]" based on the actual sample analyzed.

How We Conduct Consumer Surveys

Ongoing On-Page Surveys

Visitors to our network of sites are invited to participate in brief, anonymous surveys about their service experiences and expectations. Surveys cover:

  • Satisfaction ratings (emoji scale)
  • Specific issues encountered
  • Approximate amount paid
  • Industry standards they believe should be mandatory
  • Willingness to pay premiums for specific guarantees

Participation is voluntary. Responses are anonymous — we cannot connect a survey response to any individual. Results are aggregated by industry and city.

Confidence Scoring

Every price data point includes a confidence score from 0-100% based on:

Data with confidence below 60% is flagged for additional verification before publication.

Audit Trail

Every data point in our database maintains a complete audit trail:

This ensures complete transparency and verifiability. If a price changed, you can see exactly when and why.

Quality Gates for Published Research

Before any research article is published on our network, it must pass four independent quality checks:

Articles that fail any gate are rejected and logged. The rejection rate is typically 60-70% — most generated content doesn't meet our standards.

Limitations

We believe in stating our limitations openly:

We continuously work to address these limitations. If you spot an error or have methodological feedback, please contact research@price-quotes.com.

Data Licensing

All published research and data is licensed under CC-BY-4.0. You may freely cite, share, and build upon our work with attribution to "Price-Quotes Research Lab" and a link to the source page.