-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
adding directory with some of the WANE explorations for gov info coll…
…ection
- Loading branch information
Showing
12 changed files
with
3,469,152 additions
and
0 deletions.
There are no files selected for viewing
407 changes: 407 additions & 0 deletions
407
gov-info/.ipynb_checkpoints/day3-adding-polish-checkpoint.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
3,643 changes: 3,643 additions & 0 deletions
3,643
gov-info/.ipynb_checkpoints/gephi-export-trial-checkpoint.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
840 changes: 840 additions & 0 deletions
840
gov-info/.ipynb_checkpoints/wane-bipartite-graphs-checkpoint.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
# Working with the Gov-Info files | ||
|
||
This directory contains iPython notebooks, WANE, and GML files which were used to work with a partial set of the Interent Archive for the parl.gc.ca domain. | ||
|
||
The WANE files are a derived dataset containing extracted named entities and URLs. | ||
|
||
The Python notebooks were developed to extract the named entities and URLs from the JSON in the WANEs and transform those items into a graph using the networkx Python module. | ||
|
||
After transforming the files into a graph the graphs were saved using the GML file type for import into Gephi in order to visualize the graphs and perform eploratory data analysis. |
Oops, something went wrong.