- dealShortUrl.py
reverse the most common short urls to the original urls
- shortUrl.py
reverse the rest short urls to the original urls
- getDomain.py
web scraping to get domain
- getDNSInfo.py
web scraping to get dns info
- getWhoisInfo.py
web scraping to get whois info
- simHash.py
compute the simHash of urls
- textDistance.py
calculate the text distances of urls based on the simHash
- shang.py
calculate the entropy of texts
- countKey.py
count the numbers of the keys of urls
- trace1.py
- ultis.py
- trace2-1-step.py
- trace2-2-step.py
- trace2-3-step.py
- trace2-4-step.py
- trace2-5-step.py