xorbits.pandas.read_orc#

xorbits.pandas.read_orc(path: FilePath | ReadBuffer[bytes], columns: list[str] | None = None, dtype_backend: DtypeBackend | lib.NoDefault = _NoDefault.no_default, **kwargs) DataFrame[source]#

Load an ORC object from the file path, returning a DataFrame.

Parameters
  • path (str, path object, or file-like object) – String, path object (implementing os.PathLike[str]), or file-like object implementing a binary read() function. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. A local file could be: file://localhost/path/to/table.orc.

  • columns (list, default None) – If not None, only these columns will be read from the file. Output always follows the ordering of the file and not the columns list. This mirrors the original behaviour of .

  • dtype_backend ({"numpy_nullable", "pyarrow"}, defaults to NumPy backed DataFrames) –

    Which dtype_backend to use, e.g. whether a DataFrame should have NumPy arrays, nullable dtypes are used for all dtypes that have a nullable implementation when “numpy_nullable” is set, pyarrow is used for all dtypes if “pyarrow” is set.

    The dtype_backends are still experimential.

    New in version 2.0(pandas).

  • **kwargs – Any additional kwargs are passed to pyarrow.

Return type

DataFrame

Notes

Before using this function you should read the user guide about ORC and install optional dependencies.

Warning

This method has not been implemented yet. Xorbits will try to execute it with pandas.

This docstring was copied from pandas.