DatasetClient
Index
Methods
create_items_public_url
Generate a URL that can be used to access dataset items.
If the client has permission to access the dataset's URL signing key, the URL will include a signature to verify its authenticity.
You can optionally control how long the signed URL should be valid using the
expires_in_secsoption. This value sets the expiration duration in seconds from the time the URL is generated. If not provided, the URL will not expire.Any other options (like
limitoroffset) will be included as query parameters in the URL.Parameters
keyword-onlyoffset: int | None = None
keyword-onlylimit: int | None = None
keyword-onlyclean: bool | None = None
keyword-onlydesc: bool | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlyflatten: list[str] | None = None
keyword-onlyview: str | None = None
keyword-onlyexpires_in_secs: int | None = None
Returns str
delete
Delete the dataset.
https://docs.apify.com/api/v2#/reference/datasets/dataset/delete-dataset
Returns None
download_items
Get the items in the dataset as raw bytes.
Deprecated: this function is a deprecated alias of
get_items_as_bytes. It will be removed in a future version.https://docs.apify.com/api/v2#/reference/datasets/item-collection/get-items
Parameters
keyword-onlyitem_format: str = 'json'
keyword-onlyoffset: int | None = None
keyword-onlylimit: int | None = None
keyword-onlydesc: bool | None = None
keyword-onlyclean: bool | None = None
keyword-onlybom: bool | None = None
keyword-onlydelimiter: str | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_header_row: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlyxml_root: str | None = None
keyword-onlyxml_row: str | None = None
keyword-onlyflatten: list[str] | None = None
keyword-onlysignature: str | None = None
Returns bytes
get
Retrieve the dataset.
https://docs.apify.com/api/v2#/reference/datasets/dataset/get-dataset
Returns dict | None
get_items_as_bytes
Get the items in the dataset as raw bytes.
https://docs.apify.com/api/v2#/reference/datasets/item-collection/get-items
Parameters
keyword-onlyitem_format: str = 'json'
keyword-onlyoffset: int | None = None
keyword-onlylimit: int | None = None
keyword-onlydesc: bool | None = None
keyword-onlyclean: bool | None = None
keyword-onlybom: bool | None = None
keyword-onlydelimiter: str | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_header_row: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlyxml_root: str | None = None
keyword-onlyxml_row: str | None = None
keyword-onlyflatten: list[str] | None = None
keyword-onlysignature: str | None = None
Returns bytes
get_statistics
Get the dataset statistics.
https://docs.apify.com/api/v2#tag/DatasetsStatistics/operation/dataset_statistics_get
Returns dict | None
iterate_items
Iterate over the items in the dataset.
https://docs.apify.com/api/v2#/reference/datasets/item-collection/get-items
Parameters
keyword-onlyoffset: int = 0
keyword-onlylimit: int | None = None
keyword-onlyclean: bool | None = None
keyword-onlydesc: bool | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlysignature: str | None = None
Returns Iterator[dict]
list_items
List the items of the dataset.
https://docs.apify.com/api/v2#/reference/datasets/item-collection/get-items
Parameters
keyword-onlyoffset: int | None = None
keyword-onlylimit: int | None = None
keyword-onlyclean: bool | None = None
keyword-onlydesc: bool | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlyflatten: list[str] | None = None
keyword-onlyview: str | None = None
keyword-onlysignature: str | None = None
Returns ListPage
push_items
Push items to the dataset.
https://docs.apify.com/api/v2#/reference/datasets/item-collection/put-items
Parameters
items: JSONSerializable
Returns None
stream_items
Retrieve the items in the dataset as a stream.
https://docs.apify.com/api/v2#/reference/datasets/item-collection/get-items
Parameters
keyword-onlyitem_format: str = 'json'
keyword-onlyoffset: int | None = None
keyword-onlylimit: int | None = None
keyword-onlydesc: bool | None = None
keyword-onlyclean: bool | None = None
keyword-onlybom: bool | None = None
keyword-onlydelimiter: str | None = None
keyword-onlyfields: list[str] | None = None
keyword-onlyomit: list[str] | None = None
keyword-onlyunwind: list[str] | None = None
keyword-onlyskip_empty: bool | None = None
keyword-onlyskip_header_row: bool | None = None
keyword-onlyskip_hidden: bool | None = None
keyword-onlyxml_root: str | None = None
keyword-onlyxml_row: str | None = None
keyword-onlysignature: str | None = None
Returns Iterator[impit.Response]
update
Update the dataset with specified fields.
https://docs.apify.com/api/v2#/reference/datasets/dataset/update-dataset
Parameters
keyword-onlyname: str | None = None
keyword-onlygeneral_access: StorageGeneralAccess | None = None
Returns dict
Sub-client for manipulating a single dataset.