Dmoz-tddli.rar May 2026

About Dataset. This is an url classification dataset from dmoz directory. There are 15 class for classification.

“DMOZ — the Open Directory Project — officially closed today. It marks the end of an era of humans trying to catalog the entire web.” Search Engine Land · 9 years ago DMOZ-TDDLI.rar

As a .rar file, you will need third-party tools like WinRAR or 7-Zip to extract the contents. About Dataset

The data includes deep taxonomic paths (e.g., Science/Technology/Space ), which is excellent for testing multi-level classification algorithms. Weaknesses: “DMOZ — the Open Directory Project — officially

“Getting a website listed in DMOZ can be very frustrating... but being listed will probably help our Google rankings.” WebWorkshop URL Classification Dataset [DMOZ] - Kaggle

Since DMOZ officially closed in March 2017, a significant portion of the URLs in this archive may lead to dead links or parked domains.

Early internet professionals often noted the directory's prestige and the difficulty of getting listed.