Theundatasio
package from UnDatasIO extracts clean text from raw source documents like PDFs. This page covers how to use theundatasio
ecosystem within LangChain.
Installation and Setup
-
Install the Python SDK with
pip install undatasio
along with
pip install langchain-undatasio
to use theUnDatasIOLoader
and partition remotely against the UnDatasIO API. You will need an API key, which you can generate for free at
undatas.io. - No local system dependencies are required; all processing runs in the cloud.
Data Loaders
The primary usage ofUnDatasIO
is through the document loader.