Datasets

General conference datasets. data("genconf")

genconf

General Conference Data

Scrape

Scrape general conference talks urls and contents. Any function with scrape as a prefix will execute an rvest::read_html() call (e.g., visit the web page).

scrape_conference_html_doc_map()

Scrape HTML doc map from Conference URL

scrape_conference_talks()

Scrapes all conference talks for a sessions

scrape_conference_urls()

Main function to scrape all conference talk urls For a given year-month conference, return a nested tibble of all sessions with a tibble-column containing the dataframes

scrape_talk()

Scrape general conference talk

Utilities

extract_*() funcitons will pull html content out of an rvest::read_html() document. parse_*() will manipulate strings, etc. and don’t require a read_html() document.

extract_body_paragraphs_df()

Produce paragraphs

extract_element()

Extract html document elements

extract_session_hrefs()

Extract Session hrefs

extract_url_from_rv_doc()

Extract url from rv_doc

parse_path_for_name()

Parse path for name

parse_session_urls()

Parse Session URLs

parse_url()

Scrape an individual General Conference URL