Changes for version v0.37.0 - 2020-08-08

  • Add a site-specific extractor for www.penghutimes.com
  • dateline is reformatted differently. The time component is no longer default to 23:59:59

Modules

download and extract news articles from Internet.
A data class for containing news article.

Provides

in lib/NewsExtractor/CSSExtractor.pm
in lib/NewsExtractor/CSSRuleSet.pm
in lib/NewsExtractor/Constants.pm
in lib/NewsExtractor/Download.pm
in lib/NewsExtractor/Error.pm
in lib/NewsExtractor/Extractor.pm
in lib/NewsExtractor/GenericExtractor.pm
in lib/NewsExtractor/JSONLDExtractor.pm
in lib/NewsExtractor/Role/ContentTextExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor.pm
in lib/NewsExtractor/SiteSpecificExtractor/ChinaTimes.pm
in lib/NewsExtractor/SiteSpecificExtractor/ETtoday.pm
in lib/NewsExtractor/SiteSpecificExtractor/UDN.pm
in lib/NewsExtractor/SiteSpecificExtractor/ctee_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/estate_ltn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/focustaiwan_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/hk_crntt_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/hk_on_cc.pm
in lib/NewsExtractor/SiteSpecificExtractor/m_news_cctv_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/money_udn_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/new_ctv_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/newnet_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_cctv_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_cts_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_ebc_net_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_pts_org_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_tnn_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/news_tvbs_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/newtalk_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/talk_ltn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/turnnewsapp_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_allnews_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_bcc_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_digitimes_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_epochtimes_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_fountmedia_io.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_hkcna_hk.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_hkcnews_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_idn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ksnews_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_mdnkids_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_nownews_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ntdtv_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_penghutimes_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_peopo_org.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_rti_org_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_rvn_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_setn_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_taipeitimes_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_thestandnews_com.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ttv_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_twreporter_org.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_upmedia_mg.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_ustv_com_tw.pm
in lib/NewsExtractor/SiteSpecificExtractor/www_xinhuanet_com.pm
in lib/NewsExtractor/TXExtractor.pm
in lib/NewsExtractor/TextUtil.pm
in lib/NewsExtractor/Types.pm