Bajo los adoquines, la PLAYA 🏖️
Find a file
2026-04-13 07:16:00 -04:00
.github/workflows fix(ci): stupid git fix for deploy docs 2026-02-17 09:10:12 -05:00
benchmarks feat: refactor detr detectors together 2026-01-14 08:16:20 -05:00
docs feat!: switch to playa TextBase 2026-01-08 23:16:38 -05:00
src/paves fix: docling labels changed for no reason 2026-04-13 07:16:00 -04:00
stubs/pypdfium2 feat: bring bears in-house 2025-02-16 23:07:34 -05:00
tests feat!: switch to playa TextBase 2026-01-08 23:16:38 -05:00
.flake8 feat: parallel bears 2024-12-30 12:27:01 -05:00
.gitignore refactor: create packages for image, tables, text 2026-01-13 04:48:44 -05:00
CHANGELOG.md docs: changelog 2026-02-17 09:01:16 -05:00
LICENSE.txt chore: initial import 2024-11-27 21:38:38 -05:00
mkdocs.yml feat!: remove bears and mostly remove miner 2025-12-17 18:14:57 -05:00
pyproject.toml fix: correct for PLAYA 1.0.0 where bbox could be None 2026-02-17 08:56:14 -05:00
README.md feat: add a uv.lock 2026-01-13 07:48:35 -05:00
uv.lock chore: lock 2026-02-17 07:17:31 -05:00

PAVÉS: Bajo los adoquines, la PLAYA 🏖️

PLAYA is intended to get objects out of PDF, with no dependencies or further analysis. So, over top of PLAYA, this package provides PDF, Analyse et Visualisation simplifiÉeS.

Or, if you prefer, PDF Analysis and Visualization for dummiES.

The goal here is not to provide elaborate, enterprise-grade, battle-tested, cloud and AI-native, completely configurable and confoundingly complex classes for ETL. It's to give you some helpful functions that you can use to poke around in PDFs and get useful things out of them, often but not exclusively in the context of a Jupyter notebook.

See the documentation for more information. There will also be some helpful notebooks soon, to help you.

Installation

Install it from PyPI (as paves) with pip or uv, preferably in a virtual environment. That's all. If you want to play around in the source code you can use hatch or uv (your choice), for instance:

# with hatch
hatch shell
# with uv
uv venv
uv sync
. .venv/bin/activate

License

PAVÉS is distributed under the terms of the MIT license.