cmoncrawl.processor.extraction.filters

Functions

must_exist_filter(soup, filter_list)

This function takes in a BeautifulSoup object and a list of CSS selectors.

must_not_exist_filter(soup, filter_list)

This function takes in a BeautifulSoup object and a list of CSS selectors.