Changes for version v0.17.0 - 2020-05-03

  • Improve the extraction of journalist names on cnews, EBC, CTEE and SETN
  • Add a site-specific extractor for newnet.tw

Modules

download and extract news articles from Internet.
A data class for containing news article.

Provides

in lib/NewsExtractor/CSSExtractor.pm
in lib/NewsExtractor/CSSRuleSet.pm
in lib/NewsExtractor/Constants.pm
in lib/NewsExtractor/Download.pm
in lib/NewsExtractor/Error.pm
in lib/NewsExtractor/Extractor.pm
in lib/NewsExtractor/GenericExtractor.pm
in lib/NewsExtractor/JSONLDExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor/ChinaTimes.pm
in lib/NewsExtractor/SiteSpecificExtractor/ETtoday.pm
in lib/NewsExtractor/SiteSpecificExtractor/UDN.pm
in lib/NewsExtractor/SiteSpecificExtractor/ctee_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/estate_ltn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/money_udn_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/newnet_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_cts_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_ebc_net_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_tnn_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_tvbs_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/turnnewsapp_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_allnews_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_bcc_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ksnews_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ntdtv_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_peopo_org.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_rti_org_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_rvn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_setn_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_taipeitimes_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_upmedia_mg.pm
in lib/NewsExtractor/TXExtractor.pm
in lib/NewsExtractor/TextUtil.pm
in lib/NewsExtractor/Types.pm