Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.
-
Updated
Apr 9, 2024 - Java
Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.
Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum
Models, and associated helper code for GSOC 2017 project Tensorflow Image to Text in Apache Tika
Domain Discovery for the Sparkler Crawl Environment
This repository contains the code for a research project that implements and evaluates local word embeddings based on co-authorship and citations for query expansion in PyTerrier on the TREC-Covid dataset.
Add a description, image, and links to the irds topic page so that developers can more easily learn about it.
To associate your repository with the irds topic, visit your repo's landing page and select "manage topics."