The main problem is that Elasticsearch does not have a native way to get a list of all documents in a given index. You can use the search API, but this will return only one document at a time.
I am using django-elasticsearch-dsl and I want to get list of all document in index. How can I do that?
A:
You can use the Search
object from django_elasticsearch_dsl
. It's a subclass of the ElasticsearchDSL
Search object, so you can use all the methods from there as well. The docs for that are here: https://elasticsearch-dsl.readthedocs.io/en/latest/search_dsl.html#the-search-object
<code>from django_elasticsearch_dsl import Search, Index
s = Search(index='blog')
.query('match', title='python')response = s.execute()
for hit in response: # iterate over hits from response object
print(hit) # print each hit as a dict (default)for hit in s: # iterate over hits from search query directly
print(hit) # print each hit as a dict (default)for hit in s[0:10]: # slice results to get only first 10 hits
print(hit) # print each hit as a dict (default)len(s) # number of total hits found by query (slow!) <--- this is what you want! <--- this is what you want! <--- this is what you want! <--- this is what you want! <--- this is what you want! len(response) # number of total hits found by query (slow!) <--- this is what you want! <--- this is what you want! <--- this is what you want! <--- this is what you want! list(s)[0] # first result as a Python dictionary list(response)[0] # first result as a Python dictionary response[0] # first result as an ElasticSearch Hit response[0].meta # metadata associated with the Hit response[0].meta.score # score associated with the Hit response[0].title # title field value list(response)[1]['title'] ## second result's 'title' field value<;/pre>;<;br />;>;br />;The above code will return all documents matching your query, but it will not return any fields other than _id and _type unless they are explicitly requested via source(). To retrieve more fields, use source():<;br />;from django_elasticsearch_dsl import Search, Index, F ;from elasticsearch_dsl import Q ;import json ;import pprint ;pp = pprint.PrettyPrinter();pprint = pp.pprint ;s = Search().query('match', title='python').source([ 'title', 'body' ]) ;for i in range((len(s))): pprint((json.loads((str)(s[i]).replace("'", """)))) ;## or simply do it like below :## [{'body': 'Python and Django go together like peanut butter and jelly.'