Kind of a weird thing to track. The numbers just seem so large. 80k new gambling sites? How many of all the blogs are just 'blogspam'? Guess that doesn't matter right.
What defines a 'launch'? New domain registrations? Content being deployed?
I have a bunch of heuristics - but broadly speaking a new domain + a new good looking site. ie: some legit pages, legit socials, and some real owner/purpose behind it.
Some findings from the dataset:
Geography (top countries):
United States: 253,589
India: 34,127
Canada: 20,263
United Kingdom: 18,701
Pakistan: 10,124
(392 total countries)
Industries:
E-commerce: 164,010
Adult & Gambling: large category overall
Gambling (L2): 84,353
News & Blogs: 49,424
SaaS products: 39,105
Niches (L3):
Online Casinos: 81,608
Clothing: 33,553
Niche SaaS tools: 31,881
Home Decor: 31,273
News & Journalism: 29,750
Platforms (detected on ~295k sites):
WordPress: 116,250
Shopify: 84,407
WooCommerce: 42,615
Squarespace: 25,328
Wix: 23,598
Webflow: 3,049
TLDs: .com: 435,622
.store: 38,223
.org: 26,474
.online: 23,422
.site: 22,919
.ai: 6,167
Happy to answer questions about methodology, accuracy, crawling, classification, or detection heuristics.