Risolto: ottieni l'elenco di tutti i documenti in django-elasticsearch-dsl

Il problema principale è che Elasticsearch non ha un modo nativo per ottenere un elenco di tutti i documenti in un dato indice. Puoi utilizzare l'API di ricerca, ma questo restituirà solo un documento alla volta.

Sto usando django-elasticsearch-dsl e voglio ottenere l'elenco di tutti i documenti in index. Come posso fare ciò?

A:

È possibile utilizzare il Search oggetto da django_elasticsearch_dsl. È una sottoclasse di ElasticsearchDSL Cerca oggetto, quindi puoi utilizzare tutti i metodi anche da lì. I documenti per questo sono qui: https://elasticsearch-dsl.readthedocs.io/en/latest/search_dsl.html#the-search-object

Ottenere tutti i documenti

<code>from django_elasticsearch_dsl import Search, Index

s = Search(index='blog')
.query('match', title='python')

response = s.execute()

for hit in response: # iterate over hits from response object
print(hit) # print each hit as a dict (default)

for hit in s: # iterate over hits from search query directly
print(hit) # print each hit as a dict (default)

for hit in s[0:10]: # slice results to get only first 10 hits
print(hit) # print each hit as a dict (default)

len(s) # number of total hits found by query (slow!) &lt;--- this is what you want! &lt;--- this is what you want! &lt;--- this is what you want! &lt;--- this is what you want! &lt;--- this is what you want! len(response) # number of total hits found by query (slow!) &lt;--- this is what you want! &lt;--- this is what you want! &lt;--- this is what you want! &lt;--- this is what you want! list(s)[0] # first result as a Python dictionary list(response)[0] # first result as a Python dictionary response[0] # first result as an ElasticSearch Hit response[0].meta # metadata associated with the Hit response[0].meta.score # score associated with the Hit response[0].title # title field value list(response)[1]['title'] ## second result's 'title' field value<;/pre>;<;br />;>;br />;The above code will return all documents matching your query, but it will not return any fields other than _id and _type unless they are explicitly requested via source(). To retrieve more fields, use source():<;br />;from django_elasticsearch_dsl import Search, Index, F ;from elasticsearch_dsl import Q ;import json ;import pprint ;pp = pprint.PrettyPrinter();pprint = pp.pprint ;s = Search().query('match', title='python').source([ 'title', 'body' ]) ;for i in range((len(s))): pprint((json.loads((str)(s[i]).replace("'", """)))) ;## or simply do it like below :## [{'body': 'Python and Django go together like peanut butter and jelly.'

Related posts:

Leave a Comment