Skip to content

Commit

Permalink
Fixed the location of Better NLP in the NLP section
Browse files Browse the repository at this point in the history
  • Loading branch information
neomatrix369 committed Apr 14, 2019
1 parent 5c447e9 commit a1756b1
Show file tree
Hide file tree
Showing 2 changed files with 9 additions and 61 deletions.
68 changes: 8 additions & 60 deletions examples/better-nlp/better-nlp-spacy-texacy-examples.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Setup and installation\n",
"#### Setup and installation ( optional )\n",
"\n",
"In case, this notebook is running in a local environment (Linux/MacOS) which does not have the necessary dependencies installed the next section would need to be executed.\n",
"\n",
Expand Down Expand Up @@ -72,7 +72,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Install Spacy model\n",
"#### Install Spacy model ( NOT optional )\n",
"\n",
"Install the large English language model for spaCy - will be needed for the examples in this notebooks.\n",
"\n",
Expand Down Expand Up @@ -100,7 +100,7 @@
},
{
"cell_type": "code",
"execution_count": 1,
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -109,7 +109,7 @@
},
{
"cell_type": "code",
"execution_count": 2,
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -126,21 +126,9 @@
},
{
"cell_type": "code",
"execution_count": 3,
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Loading model 'en'...\n",
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n",
"model_loading_time_in_secs= 16.76767325401306\n",
"model_loading_method= directly, first time\n",
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n"
]
}
],
"outputs": [],
"source": [
"model_loading_result = betterNLP.load_nlp_model()\n",
"print(\"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\")\n",
Expand All @@ -153,49 +141,9 @@
},
{
"cell_type": "code",
"execution_count": 4,
"execution_count": null,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n",
"extract_entities_processing_time_in_secs= 0.05238509178161621\n",
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n",
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n",
"Denis Guedj (PERSON)\n",
"1940 – April 24, 2010 (DATE)\n",
"French (NORP)\n",
"the History of Science (ORG)\n",
"Paris VIII University (ORG)\n",
"Setif (NORP)\n",
"many years (DATE)\n",
"The Parrot's Theorem (WORK_OF_ART)\n",
"Paris (GPE)\n",
"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n",
" Token entity types\n",
"0 PERSON = People, including fictional\n",
"1 NORP = Nationalities or religious or political...\n",
"2 FAC = Buildings, airports, highways, bridges, etc\n",
"3 ORG = Companies, agencies, institutions, etc\n",
"4 GPE = Countries, cities, states\n",
"5 LOC = Non-GPE locations, mountain ranges, bodi...\n",
"6 PRODUCT = Objects, vehicles, foods, etc. (Not ...\n",
"7 EVENT = Named hurricanes, battles, wars, sport...\n",
"8 WORK_OF_ART = Titles of books, songs, etc\n",
"9 LAW = Named documents made into laws\n",
"10 LANGUAGE = Any named language\n",
"11 DATE = Absolute or relative dates or periods\n",
"12 TIME = Times smaller than a day\n",
"13 PERCENT = Percentage, including ”%“\n",
"14 MONEY = Monetary values, including unit\n",
"15 QUANTITY = Measurements, as of weight or distance\n",
"16 ORDINAL = “first”, “second”, etc\n",
"17 CARDINAL = Numerals that do not fall under ano...\n"
]
}
],
"outputs": [],
"source": [
"parsed_generic_text = betterNLP.extract_entities(model, generic_text)\n",
"print(\"~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\")\n",
Expand Down
2 changes: 1 addition & 1 deletion natural-language-processing/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@
## Library, Framework, Models, Tools

- [BloomburyAI's Open Source NLP tool: Cape Webservices - backend server](https://github.com/bloomsburyai/cape-webservices) | [Rest of BloomburyAI's Open Source NLP tool - Cape](https://www.github.com/bloomsburyai) [Bought out by FB around March/April 2019]
- [Better NLP library (experimental)](./examples/better-nlp/)
- [Better NLP library (experimental)](../examples/better-nlp)
- [Facebook's PyText](https://github.com/facebookresearch/PyText)
- [Facebook's FastText](https://github.com/facebookresearch/FastText) | [homepage | docs](https://fasttext.cc/)
- [Smile - Statistical Machine Intelligence and Learning Engine](https://haifengl.github.io/smile)
Expand Down

0 comments on commit a1756b1

Please sign in to comment.