Check whether a (social) media post already exists and has been geolocated in several databases (e.g. Belllingcat's, Cen4InfoRes, GeoConfirmed).
Has both a simple web UI as well as an API and a command-line client.
Requires Python, pip
and the flask
package (the CLI can be used without
flask
).
For data extraction, the
osint-geo-extractor library is
used.
virtualenv .venv
.venv/bin/activate
pip install -e .
Command line
# Before first run: Download necessary files
media_search -o
# Dump data into pre-formatted database so the web API can use it.
media_search -d
# Print URLs as JSON
media_search -j
# Seach for a specific URL
media_search <url>
The Belllingcat
database is from their
Civilian Harm in Ukraine project.
There's an "Export to JSON" button.
The Cen4InfoRes
database is from the Center for Information Resilience's
Eyes on Russia map.
The GeoConfirmed
database is from the
GeoConfirmed map.
The Reukraine
database is from reukraine.shtab.net.
Note: Beta status, currently not publicly available.
The Texty
database is from
Texty.org.ua.
with kind permission from the authors.
The media_search -o
command will download the data on your behalf and put it
into a data/
folder.
There's a simple flask
application with a plain HTML+JS frontend.
The API part requires you to do a dump of the pre-formatted database beforehand.
Run media_search -d
before you start the API server.
Run the server:
$ flask --app 'media_search.web' run --port 8000
By default, the server will run on http://localhost:8000/
. It is not
recommended to run a flask
development server in production.
Use e.g. gunicorn:
$ gunicorn -w 2 media_search.web:app'
Available routes
-
/
: Web UI -
/api/v1.1/export
-GET
- Export all URLs as JSON -
/api/v1.1/query
-GET
/POST
- Check if URL is in a database
params:urls
: List of urls to search (asPOST
JSON body array of strings or comma-separatedGET
argument)format
:text
,json
orcsv
- Show results in web UI (text
) or force downloading asresults.json
orresults.csv
file
response:
{ "dataset": { "t.me/truexanewsua/57093": [ { "desc": "2022-08-15 - A naval mine exploded...", "id": "UW11823", "location": { "latitude": "46.0552", "longitude": "30.443433", "place_desc": "Ukraine - Odessa Oblast - Zatoka" }, "source": "CENINFORES", "unsanitized_url": "https://t.me/truexanewsua/57093" } ], "twitter.com/foo/bar": ["..."], "baz/bar": ["..."], }, "message": "Success! Url found in database", "success": true }
If the entry as not found,
success
isfalse
. -
/api/v1.1/query/csv
-POST
- Check all links in uploaded.csv
file
params: File asPOST
form data namedfile
curl example:
# POST
curl 'http://localhost:8000/api/v1.1/query' \
-X POST \
-H 'Content-Type: application/json' \
--data-raw '{"urls":["https://twitter.com/RALee85/status/1497853526881546241"]}'
# GET
curl 'http://localhost:8000/api/v1.1/query?\
urls=https://twitter.com/RALee85/status/1497853526881546241,\
https://twitter.com/GeoConfirmed/status/1508518239567065090'
Web UI screenshot:
$ media_search 'https://twitter.com/RedIntelPanda/status/1488569554028707847'
Found URL https://twitter.com/RedIntelPanda/status/1488569554028707847 in 'CEN4INFORES' dataset
Id: UW0067
Description:
DATE: 01/01/2022
LINK: https://twitter.com/GirkinGirkin/status/1488544876908187650
GEOLOCATION: https://twitter.com/RedIntelPanda/status/1488569554028707847
BRIEF DESCRIPTION: Russian military equipment moving in Belarus
COUNTRY: Belarus
PROVINCE:
DISTRICT:
TOWN/CITY: Petrishki
COORDINATES: 54.069385, 27.211645
ARMS/MUNITION:
VIOLENCE LEVEL: 1
ENTRY: UW0067
Copyright 2022 conflict-investigations Team. Licensed under MIT License
autosize library by Jack Moore, MIT.
Marker favicon by Leaflet project, Copyright (c) 2010-2022, Vladimir Agafonkin