Word Cloud LinksAnimation (3) CSP (2) Data (22) Data Science (3) Distributions (1) Dust (7) Economics (14) Engineering (9) Equipment Venders (1) Faster R (1) GDAL (5) Germany (1) ggplot2 (19) GIS (5) Irradiance (17) Kuwait (1) LaTeX (11) Linux (2) Meteorology (16) Misc Tricks (3) Modeling (6) Natural Gas (2) Nuclear (1) O&M (2) Projects (5) Project Valuations (6) Qatar RE (13) R Colors (9) R Data Import (6) R Data Objects (16) R Data Syntax (6) Renewable Energy (14) RE Policy (7) Resource Assessment (19) R Graphics (19) R Packages (3) R Programming (20) Saudi Arabia (2) Scientific Computing (1) Solar (35) Spatial Analysis (6) Storage (1) UAE (3) Ubuntu (1) Website (2) WECC (2) Wind (5)
Category Archives: Misc Tricks
A new method to extract data tables from PDF files is introduced. Most of the data scraping tools available are browser-based. The common tools are also manual in nature and limited to one table at a time. A solution is outlined to extract multiple tables at once. The solution combines the R programming language with the open-source Java program Tabula. The result is a convenient method that transforms documents into databases.
The ability to train a machine to extract data tables from PDF files has several benefits:
Beamer is a document class that is by far the most practical tool for making presentations involving data science, business analytics, or general research. It is widely used in most conferences and easily lends itself to data intensive reporting and repetitive batch processing.
A custom beamer template is presented that is easy to extend or modify. The benefits of the beamer document are numerous:
Pretty R is an online tool and r syntax highlighter that transforms R source code into HTLM code for website development. The result is easy to read R code for high quality web presentations. The Pretty R webpage is a good learning tool as it provides the HTML code details required to deliver syntax highlighting that complies with R documentation from inside-r.org.