Get Crawl
Check the crawl status to see when the crawl completes.
HTTP
POST /docs/get_crawl HTTP/1.1
Api-Key: my_api_key
Content-Type: application/json
Host: public.us-west-2.aws.nautilusdb.com
{
"namespace_name": "string",
"collection_name": "string",
"crawl_id": "string"
}
Response:
{
"root_url": "string",
"created_on": int,
"max_pages": int,
// crawl_status: 1 = RUNNING, 2 = SUCCEEDED, 3 = FAILED
"crawl_status": int,
"crawl_result": {
// error_code and error_msg is Optional, only set if crawling url1 fails
"url1": {"char_count": int, "error_code": "string" or null, "error_msg": "string" or null},
...
}
}
Python
import nautilusdb as ndb
# Configure ndb to use the API key
ndb.init(api_key="my_api_key")
col = ndb.collection('my_collection')
# check the crawl status
resp = col.get_crawl(crawl_id)
print(resp.crawl_status)