Integrate Elasticsearch with WordPress.
Please note: the master branch is the stable
Upgrade Notice: Versions 1.6.1, 1.6.2, 1.7, and 1.8 require re-indexing.
Let's face it, WordPress search is rudimentary at best. Poor performance, inflexible and rigid matching algorithms (which means no comprehension of 'close' queries), the inability to search metadata and taxonomy information, no way to determine categories of your results and most importantly the overall relevancy of results is poor.
Elasticsearch is a search server based on Lucene. It provides a distributed, multitenant-capable full-text search engine with a RESTful web interface and schema-free JSON documents.
Coupling WordPress with Elasticsearch allows us to do amazing things with search including:
- Relevant results
- Autosuggest
- Fuzzy matching (catch misspellings as well as 'close' queries)
- Proximity and geographic queries
- Search metadata
- Search taxonomies
- Facets
- Search all sites on a multisite install
- The list goes on...
The goal of ElasticPress is to integrate WordPress with Elasticsearch. This plugin integrates with the WP_Query object returning results from Elasticsearch instead of MySQL.
There are other Elasticsearch integration plugins available for WordPress. ElasticPress, unlike others, offers multi-site search. Elasticsearch is a complex topic and integration results in complex problems. Rather than providing a limited, clunky UI, we elected to instead provide full control via WP-CLI.
- Elasticsearch 1.3+
- WordPress 3.7.1+
- WP-CLI 0.13+, if using WP-CLI for indexing
- First, you will need to properly install and configure Elasticsearch.
- ElasticPress requires WP-CLI, if you want to use WP-CLI for indexing. Install it by following these instructions.
- Install the plugin in WordPress. You can download a zip via Github and upload it using the WP plugin uploader.
First, make sure you have Elasticsearch and WP-CLI configured properly.
- Define the constant
EP_HOST
in yourwp-config.php
file with the connection (and port) of your Elasticsearch application. For example:
define( 'EP_HOST', 'http://192.168.50.4:9200' );
Note: The URL for EP_HOST
must begin with a protocol specifier (http
or https
). URLs without a protocol prefix will not be parsed correctly and will cause ElasticPress to error out.
The proceeding sets depend on whether you are configuring for single site or multi-site with cross-site search capabilities.
- Activate the plugin.
- Using WP-CLI, do an initial sync (with mapping) with your ES server by running the following commands:
wp elasticpress index --setup
- Network activate the plugin
- Using WP-CLI, do an initial sync (with mapping) with your ES server by running the following commands:
wp elasticpress index --setup --network-wide
First, make sure you have Elasticsearch configured properly.
- Activate the plugin on the single site you want to index.
- Go to the settings page, found at Settings > ElasticPress.
- Set the Elasticsearch host in the proper input, with the connection (and port) of your Elasticsearch application. For example:
http://192.168.50.4:9200
Note: The URL for the Elasticsearch host must begin with a protocol specifier (http
or https
). URLs without a protocol prefix will not be parsed correctly and will cause ElasticPress to error out.
- Network activate the plugin
- Go to the settings page, found at Settings > ElasticPress, in the Network Admin.
- Set the Elasticsearch host in the proper input, with the connection (and port) of your Elasticsearch application. For example:
http://192.168.50.4:9200
Note: The URL for the Elasticsearch host must begin with a protocol specifier (http
or https
). URLs without a protocol prefix will not be parsed correctly and will cause ElasticPress to error out.
- Once a proper host is set, you can now click the Run Index button to start the indexing process.
- Once indexing is done, refresh this page to view the status and some stats.
After your index finishes, WP_Query
will be integrated with Elasticsearch and support a few special parameters.
Creating indices is handled automatically by ElasticPress. Index names are automatically generated based on site URL.
After running an index, ElasticPress integrates with WP_Query
if and only if the query is a search or the ep_integrate
parameter is passed (see below). The end goal is to support all the parameters available to WP_Query so the transition is completely transparent. Right now, our WP_Query integration supports many of the relevant WP_Query parameters and adds a couple special ones.
-
s
(string)Search keyword. By default used to search against
post_title
,post_content
, andpost_excerpt
. -
posts_per_page
(int)Number of posts to show per page. Use -1 to show all posts (the
offset
parameter is ignored with a -1 value). Set thepaged
parameter to paginate based onposts_per_page
. -
tax_query
(array)Filter posts by terms in taxonomies. Takes an array of form:
new WP_Query( array( 's' => 'search phrase', 'tax_query' => array( array( 'taxonomy' => 'taxonomy-name', 'field' => 'slug', 'terms' => array( 'term-slug-1', 'term-slug-2', ... ), ), ), ) );
tax_query
accepts an array of arrays where each inner array only supportstaxonomy
(string),field
(string), andterms
(string|array) parameters.field
must be set toslug
,name
, orterm_id
. The default value forfield
isterm_id
.terms
must be a string or an array of term slug(s), name(s), or id(s). -
The following shorthand parameters can be used for querying posts by specific dates:
year
(int) - 4 digit year (e.g. 2011).month
ormonthnum
(int) - Month number (from 1 to 12).week
(int) - Week of the year (from 0 to 53).day
(int) - Day of the month (from 1 to 31).dayofyear
(int) - Day of the month (from 1 to 365 or 366 for leap year).hour
(int) - Hour (from 0 to 23).minute
(int) - Minute (from 0 to 59).second
(int) - Second (0 to 59).dayofweek
(int|array) - Weekday number, when week starts at Sunday (1 to 7).dayofweek_iso
(int|array) - Weekday number, when week starts at Monday (1 to 7).
This is a simple example which will return posts which are created on January 1st of 2012 from all sites:
new WP_Query( array( 's' => 'search phrase', 'sites' => 'all', 'year' => 2012, 'monthnum' => 1, 'day' => 1, ) );
-
date_query
(array)date_query
accepts an array of keys and values (array|string|int) to find posts created on specific dates/times as well as an array of arrays with keys and values (array|string|int|boolean) containing the following parametersafter
,before
,inclusive
,compare
,column
, andrelation
.column
is used to query specific columns from thewp_posts
table. This will return posts which are created after January 1st 2012 and January 3rd 2012 8AM GMT:new WP_Query( array( 's' => 'search phrase', 'date_query' => array( array( 'column' => 'post_date', 'after' => 'January 1st 2012', ), array( 'column' => 'post_date_gmt', 'after' => 'January 3rd 2012 8AM', ), ), ) );
Currently only the
AND
value is supported for therelation
parameter.inclusive
is used on after/before options to determine whether exact value should be matched or not. If inclusive is used and you pass in sting without specific time, it will be converted to 00:00:00 on that date. In this case, even if inclusive was set to true, the date would not be included in the query. If you want to include that specific date, you need to pass the time as well. (e.g. 'before' => '2012-01-03 23:59:59')The example will return all posts which are created on January 5th 2012 after 10:00PM and 11:00PM inclusively, because the time is specified:
new WP_Query( array( 's' => 'search phrase', 'date_query' => array( array( 'column' => 'post_date', 'before' => 'January 5th 2012 11:00PM', ), array( 'column' => 'post_date', 'after' => 'January 5th 2012 10:00PM', ), 'inclusive' => true, ), ) );
compare
supports the following options:=
- Posts will be returned that are created on a specified date.!=
- Posts will be returned that are not created on a specified date.>
- Posts will be returned that are created after a specified date.>=
- Posts will be returned that are created on a specified date or after.<
- Posts will be returned that are created before a specified date.<=
- Posts will be returned that are created on a specified date or before that.BETWEEN
- Posts will be returned that are created between a specified range.NOT BETWEEN
- Posts will be returned that are created not in a specified range.IN
- Posts will be returned that are created on any of the specified dates.NOT IN
- Posts will be returned that are not created on any of the specified dates.
compare
can be combined with shorthand parameters as well as withafter
andbefore
. This example will return all posts which are created during Monday to Friday, between 9AM to 5PM:new WP_Query( array( 's' => 'search phrase', 'date_query' => array( array( 'hour' => 9, 'compare' => '>=', ), array( 'hour' => 17, 'compare' => '<=', ), array( 'dayofweek' => array( 2, 6 ), 'compare' => 'BETWEEN', ), ), ) );
-
meta_query
(array)Filter posts by post meta conditions. Meta arrays and objects are serialized due to limitations of Elasticsearch. Takes an array of form:
new WP_Query( array( 's' => 'search phrase', 'meta_query' => array( array( 'key' => 'key_name', 'value' => 'meta value', 'compare' => '=', ), ), ) );
meta_query
accepts an array of arrays where each inner array only supportskey
(string),type
(string),value
(string|array|int), andcompare
(string) parameters.compare
supports the following:* ```=``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that equals the value passed to ```value```. * ```!=``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that does NOT equal the value passed to ```value```. * ```>``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that is greater than the value passed to ```value```. * ```>=``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that is greater than or equal to the value passed to ```value```. * ```<``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that is less than the value passed to ```value```. * ```<=``` - Posts will be returned that have a post meta key corresponding to ```key``` and a value that is less than or equal to the value passed to ```value```. * ```EXISTS``` - Posts will be returned that have a post meta key corresponding to ```key```. * ```NOT EXISTS``` - Posts will be returned that do not have a post meta key corresponding to ```key```.
The outer array also supports a
relation
(string) parameter. By defaultrelation
is set toAND
:new WP_Query( array( 's' => 'search phrase', 'meta_query' => array( array( 'key' => 'key_name', 'value' => 'meta value', 'compare' => '=', ), array( 'key' => 'key_name2', 'value' => 'meta value', 'compare' => '!=', ), 'relation' => 'AND', ), ) );
Possible values for
relation
areOR
andAND
. Ifrelation
is set toAND
, all inner queries must be true for a post to be returned. Ifrelation
is set toOR
, only one of the inner meta queries must be true for the post to be returned.type
supports the following values: 'NUMERIC', 'BINARY', 'CHAR', 'DATE', 'DATETIME', 'DECIMAL', 'SIGNED', 'TIME', and 'UNSIGNED'. By default WordPress casts meta values to these types in MySQL so some of these don't make sense in the context of Elasticsearch. ElasticPress does no "runtime" casting but instead compares the value to a different type compiled during indexingNUMERIC
- Compares queryvalue
to integer version of stored meta value.SIGNED
- Compares queryvalue
to integer version of stored meta value.UNSIGNED
- Compares queryvalue
to integer version of stored meta value.BINARY
- Compares queryvalue
to raw, unanalyzed version of stored meta value. For actual attachment searches, check out this.CHAR
- Compares queryvalue
to raw, unanalyzed version of stored meta value.DECIMAL
- Compares queryvalue
to float version of stored meta value.DATE
- Compares queryvalue
to date version of stored meta value. Queryvalue
must be formatted like2015-11-14
DATETIME
- Compares queryvalue
to date/time version of stored meta value. Queryvalue
must be formatted like2012-01-02 05:00:00
oryyyy:mm:dd hh:mm:ss
.TIME
- Compares queryvalue
to time version of stored meta value. Queryvalue
must be formatted like17:00:00
orhh:mm:ss
.
If no type is specified, ElasticPress will just deduce the type from the comparator used.
type
is very rarely needed to be used. -
post_type
(string/array)Filter posts by post type.
any
will search all public post types.WP_Query
defaults to eitherpost
orany
if nopost_type
is provided depending on the context of the query. This is confusing. ElasticPress will ALWAYS default toany
if nopost_type
is provided. If you want to search forpost
posts, you MUST specifypost
as thepost_type
. -
post__in
(array)Specify post IDs to retrieve.
-
post__not_in
(array)Specify post IDs to exclude.
-
offset
(int)Number of posts to skip in ascending order.
-
paged
(int)Page number of posts to be used with
posts_per_page
. -
author
(int)Show posts associated with certain author ID.
-
author_name
(string)Show posts associated with certain author. Use
user_nicename
(NOT name). -
orderby
(string)Order results by field name instead of relevance. Supports:
title
,name
,date
, andrelevance
; anything else will be interpretted as a document path i.e.meta.my_key.long
ormeta.my_key.raw
. You can sort by multiple fields as well i.e.title meta.my_key.raw
-
order
(string)Which direction to order results in. Accepts
ASC
andDESC
. Default isDESC
. -
post_parent
(int)Show posts that have the specified post parent.
The following are special parameters that are only supported by ElasticPress.
-
search_fields
(array)If not specified, defaults to
array( 'post_title', 'post_excerpt', 'post_content' )
.-
post_title
(string)Applies current search to post titles.
-
post_content
(string)Applies current search to post content.
-
post_excerpt
(string)Applies current search to post excerpts.
-
taxonomies
(string => array/string)Applies the current search to terms within a taxonomy or taxonomies. The following will fuzzy search across
post_title
,post_excerpt
,post_content
, and terms within taxonomiescategory
andpost_tag
:new WP_Query( array( 's' => 'term search phrase', 'search_fields' => array( 'post_title', 'post_content', 'post_excerpt', 'taxonomies' => array( 'category', 'post_tag' ), ), ) );
-
meta
(string => array/string)Applies the current search to post meta. The following will fuzzy search across
post_title
,post_excerpt
,post_content
, and post meta keysmeta_key_1
andmeta_key_2
:new WP_Query( array( 's' => 'meta search phrase', 'search_fields' => array( 'post_title', 'post_content', 'post_excerpt', 'meta' => array( 'meta_key_1', 'meta_key_2' ), ), ) );
-
author_name
(string)Applies the current search to author login names. The following will fuzzy search across
post_title
,post_excerpt
,post_content
and authoruser_login
:new WP_Query( array( 's' => 'username', 'search_fields' => array( 'post_title', 'post_content', 'post_excerpt', 'author_name', ), ) );
-
-
aggs
(array)Add aggregation results to your search result. For example:
new WP_Query( array( 's' => 'search phrase', 'aggs' => array( 'name' => 'name-of-aggregation', // (can be whatever you'd like) 'use-filter' => true // (*bool*) used if you'd like to apply the other filters (i.e. post type, tax_query, author), to the aggregation 'aggs' => array( 'name' => 'name-of-sub-aggregation', 'terms' => array( 'field' => 'terms.name-of-taxonomy.name-of-term', ), ), ), ) );
-
cache_results
(boolean)This is a built-in WordPress parameter that caches retrieved posts for later use. It also forces meta and terms to be pulled and cached for each cached post. It is extremely important to understand when you use this parameter with ElasticPress that terms and meta will be pulled from MySQL not Elasticsearch during caching. For this reason,
cache_results
defaults to false. -
sites
(int/string/array)This parameter only applies in a multi-site environment. It lets you search for posts on specific sites or across the network.
By default,
sites
defaults tocurrent
which searches the current site on the network:new WP_Query( array( 's' => 'search phrase', 'sites' => 'current', ) );
You can search on all sites across the network:
new WP_Query( array( 's' => 'search phrase', 'sites' => 'all', ) );
You can also specify specific sites by id on the network:
new WP_Query( array( 's' => 'search phrase', 'sites' => 3, ) );
You can even specify a group of specific sites on the network:
new WP_Query( array( 's' => 'search phrase', 'sites' => array( 2, 3 ), ) );
Note: Nesting cross-site
WP_Query
loops can result in unexpected behavior. -
ep_integrate
(bool)Allows you to perform queries without passing a search parameter. This is pretty powerful as you can leverage Elasticsearch to retrieve queries that are too complex for MySQL (such as a 5-dimensional taxonomy query). For example:
Get 20 of the latest posts
new WP_Query( array( 'ep_integrate' => true, 'post_type' => 'post', 'posts_per_page' => 20, ) );
Get all posts with a specific category slug
new WP_Query( array( 'ep_integrate' => true, 'post_type' => 'post', 'posts_per_page' => -1, 'tax_query' => array( array( 'taxonomy' => 'category', 'terms' => array( 'term-slug' ), 'field' => 'slug', ), ), ) );
The following commands are supported by ElasticPress:
-
wp elasticpress index [--setup] [--network-wide] [--posts-per-page] [--nobulk] [--offset] [--show-bulk-errors] [--post-type] [--keep-active]
Index all posts in the current blog.
--network-wide
will force indexing on all the blogs in the network.--network-wide
takes an optional argument to limit the number of blogs to be indexed across where 0 is no limit. For example,--network-wide=5
would limit indexing to only 5 blogs on the network.--setup
will clear the index first and re-send the put mapping.--posts-per-page
let's you determine the amount of posts to be indexed per bulk index (or cycle).--nobulk
let's you disable bulk indexing.--offset
let's you skip the first n posts (don't forget to remove the--setup
flag when resuming or the index will be emptied before starting again).--show-bulk-errors
displays the error message returned from Elasticsearch when a post fails to index (as opposed to just the title and ID of the post).--post-type
let's you specify which post types will be indexed (by default: all indexable post types are indexed). For example,--post-type="my_custom_post_type"
would limit indexing to only posts from the post type "my_custom_post_type". Accepts multiple post types separated by comma.--keep-active
let's you keep ElasticPress active during indexing (cannot be used with--setup
).
-
wp elasticpress activate
Turns on ElasticPress integration. Integration is automatically deactivated during indexing if
--keep-active
isn't passed or--setup
is passed. -
wp elasticpress deactivate
Turns off ElasticPress integration. Integration is automatically deactivated during indexing if
--keep-active
isn't passed or--setup
is passed. -
wp elasticpress delete-index [--network-wide]
Deletes the current blog index.
--network-wide
will force every index on the network to be deleted. -
wp elasticpress is-active
Checks whether ElasticPress is currently integration active. This is different than whether the plugin is WordPress active or not. During indexing, integration will be deactivated automatically.
-
wp elasticpress put-mapping [--network-wide]
Sends plugin put mapping to the current blog index.
--network-wide
will force mappings to be sent for every index in the network. -
wp elasticpress recreate-network-alias
Recreates the alias index which points to every index in the network.
-
wp elasticpress stats
Returns basic stats on Elasticsearch instance i.e. number of documents in current index as well as disk space used.
-
wp elasticpress status
-
ElasticPress can be used with the Elasticsearch Shield plugin
- Define the constant
ES_SHIELD
in yourwp-config.php
file with the username and password of your Elasticsearch Shield user. For example:
- Define the constant
define( 'ES_SHIELD', 'username:password' );
Follow the configuration instructions above to setup the plugin.
Within the terminal change directories to the plugin folder. Initialize your testing environment by running the following command:
For VVV users:
bash bin/install-wp-tests.sh wordpress_test root root localhost latest
For VIP Quickstart users:
bash bin/install-wp-tests.sh wordpress_test root '' localhost latest
where:
wordpress_test
is the name of the test database (all data will be deleted!)root
is the MySQL user nameroot
is the MySQL user password (if you're running VVV). Blank if you're running VIP Quickstart.localhost
is the MySQL server hostlatest
is the WordPress version; could also be 3.7, 3.6.2 etc.
Our test suite depends on a running Elasticsearch server. You can supply a host to PHPUnit as an environmental variable like so:
EP_HOST="http://192.168.50.4:9200" phpunit
We have a Debug Bar Plugin available for ElasticPress. This tool allows you to examine all the ElasticPress queries on each page load.
If you identify any errors or have an idea for improving the plugin, please open an issue. We're excited to see what the community thinks of this project, and we would love your input!
ElasticPress is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.