DNS-GFWlist

A project aimed at building a GFWlist of DNS pollution by exploring polluted DNS data.

Project Description

The inception of this project was solely for the purpose of a "Computer Network Course" experiment. The creator of this project strictly adheres to the stipulations outlined in the "Computer Information Network International Internet Security Protection Management Measures."
As a result of employing less sophisticated request methods, long-term stable querying isn't guaranteed. However, the project was functioning normally as of December 11, 2020.
The testing environment was the Nanjing University campus network. Upon testing, it was found that the level of DNS pollution and DNS attacks on the campus were relatively mild, with the results far less than the records in GFWlist.
The PowerPoint slides for the class sharing are also stored in the root directory of this project.
The focus of this project is purely technical. It does not involve personal emotions or seek to infer the author's position.

How to Use

Install the dependencies:

pip install -r requirements.txt

Run main.py

Technical Implementation

Motivation:

Upon continuous attempts to request www.google.com, it was observed that the IP addresses returned by DNS are few and the same.
If the incorrect IP addresses returned by DNS are queried in reverse, other polluted websites can be found. (This only applies to DNS reverse queries within China, overseas DNS reverse query APIs are not effective.)

Search Method:

First, a set of initial websites is specified (selecting those that are heavily attacked by DNS), and a dictionary is initialized with a large weight.

Several iterations are performed:

Select the website with the current highest weight from the dictionary.
Conduct a DNS request for the website and perform a reverse query.
If the request is indeed attacked, the weights of all websites found in the reverse query in the dictionary are increased by one.

Return the updated dictionary in descending order.

Python Packages Used

requests: Constructs requests for IP reverse queries on IP138.
beautifulsoup4: Processes the requested data.
dnspython: Performs DNS requests for websites.
python_Levenshtein: Calculates the edit distance between the domain name string of the request and the domain name list obtained through the IP reverse query to filter out unpolluted DNS query results.

Future Possibilities

Classify the obtained websites (perhaps a new way to acquire the latest adult content domain names? But this may not carry much significance).
Analyze the DNS pollution situation across different regions and service providers.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
lib		lib
.gitignore		.gitignore
DNS污染.pdf		DNS污染.pdf
README-ZH.md		README-ZH.md
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNS-GFWlist

Project Description

How to Use

Technical Implementation

Motivation:

Search Method:

Python Packages Used

Future Possibilities

About

Releases

Packages

Languages

Submergence2000/DNS-GFWlist

Folders and files

Latest commit

History

Repository files navigation

DNS-GFWlist

Project Description

How to Use

Technical Implementation

Motivation:

Search Method:

Python Packages Used

Future Possibilities

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages