|
Data mining meta-directory: general resources with emphasis on
fraud detection, CRM, advertising technology, web mining, probabilistic trading,
scorecards, risk management, market research, business intelligence, artificial intelligence,
statistical technology.
Data Sets
- Using Multivariate Statistics by Barbara Tabachnik and Linda Fidell - Data sets from the book
- Server logs data - 150,000 visitors over a 12-month time period. Analyse
performance of various advertising campaigns, create click fraud detection rules. One week sample.
- Search Domain Names - For instance, find all
web domains containing the keyword analytic. Also provide web rankings.
- UCI KDD Archive - Datasets used for research in machine learning and knowledge discovery
- WorldNet - Lexical database for the English language, useful for search engine and keyword intelligence
- IP Lists - List of web robots and other bad IP addresses, useful for fraud detection
- Anonymous Proxy Servers - List of suspicious IP addresses, useful for fraud detection
- Digital Envoy - IP intelligence: country, connection speed, domain name, zipcode
- Third party research - competitive intelligence, market research data
- Zip Code Data - with state, area, X and Y coordinates, to produce graphs like this
- Grain Market Research - Historical Futures, Stock and Tick Data
- Yahoo Finance - Stock market historical data, daily granularity
|
|