Package: PDFR 0.1.0
PDFR: Extract Text From PDFs In An R Friendly Way
Extracts text from PDF into an R dataframe giving the content, size, position and font of any text elements. This information can then be manipulated in R.
Authors:
PDFR_0.1.0.tar.gz
PDFR_0.1.0.zip(r-4.5)PDFR_0.1.0.zip(r-4.4)PDFR_0.1.0.zip(r-4.3)
PDFR_0.1.0.tgz(r-4.4-x86_64)PDFR_0.1.0.tgz(r-4.4-arm64)PDFR_0.1.0.tgz(r-4.3-x86_64)PDFR_0.1.0.tgz(r-4.3-arm64)
PDFR_0.1.0.tar.gz(r-4.5-noble)PDFR_0.1.0.tar.gz(r-4.4-noble)
PDFR_0.1.0.tgz(r-4.4-emscripten)PDFR_0.1.0.tgz(r-4.3-emscripten)
PDFR.pdf |PDFR.html✨
PDFR/json (API)
NEWS
# Install 'PDFR' in R: |
install.packages('PDFR', repos = c('https://elipousson.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/allancameron/pdfr/issues
data-scientistsextract-textpdfpdf-format
Last updated 2 years agofrom:955c122cd0. Checks:OK: 1 NOTE: 4 WARNING: 4. Indexed: no.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 22 2024 |
R-4.5-win-x86_64 | NOTE | Oct 22 2024 |
R-4.5-linux-x86_64 | NOTE | Oct 22 2024 |
R-4.4-win-x86_64 | NOTE | Oct 22 2024 |
R-4.4-mac-x86_64 | WARNING | Oct 22 2024 |
R-4.4-mac-aarch64 | WARNING | Oct 22 2024 |
R-4.3-win-x86_64 | NOTE | Oct 22 2024 |
R-4.3-mac-x86_64 | WARNING | Oct 22 2024 |
R-4.3-mac-aarch64 | WARNING | Oct 22 2024 |
Exports:draw_glyphget_objectget_xrefgetglyphmapgetpagestringpdfboxespdfdocpdfgraphicspdfgrobspdfpagepdfplotpdfr_paths
Dependencies:briocallrclicrayondescdiffobjdigestevaluatefansifsgluejsonlitelifecyclemagrittrpillarpkgbuildpkgconfigpkgloadpraiseprocessxpsR6Rcpprematch2rlangrprojroottestthattibbleutf8vctrswaldowithr
Readme and manuals
Help Manual
Help page | Topics |
---|---|
draw_glyph | draw_glyph |
Get the contents of a pdf object | get_object |
Get a pdf's xref table as an R dataframe | get_xref |
Return map of glyphs from a page | getglyphmap |
pagestring | getpagestring |
pdfboxes | pdfboxes |
pdfdoc | pdfdoc |
pdfgraphics | pdfgraphics |
pdfgrobs | pdfgrobs |
pdfpage | pdfpage |
pdfplot | pdfplot |
Paths to test pdfs | pdfr_paths |
A tool used for symbol registration | run_testthat_tests |