**------------------------------------------------------------------------------------------------ * @header_start * WebGrab+Plus ini for grabbing EPG data from TvGuide websites * @Site: telkku.com * @MinSWversion: * @Revision 0 - [20/05/2014] Jan van Straaten * - creation * @Remarks: * @header_end **------------------------------------------------------------------------------------------------ * site {url=www.telkku.com|timezone=UTC+02:00|maxdays=31|cultureinfo=fi-FI|charset=iso-8859-1|titlematchfactor=90} site {episodesystem=onscreen} * urldate.format {datestring|yyyyMMdd} url_index{url|http://www.telkku.com/channel/list/|channel|/|urldate|} *http://www.telkku.com/channel/list/1/20150910 url_index.headers {customheader=Accept-Encoding=gzip,deflate} * to speedup the downloading of the index pages * index_showsplit.scrub {regex||()||} index_start.scrub {regex||
(\d{2}:\d{2})||} index_title.scrub {regex||
\d{2}:\d{2}\s(.+?)
||} index_description.scrub {regex||
\s+?
(.*?)
||} index_episode.scrub {regex||\s+?
(.*?)
||} index_description.modify {remove|'index_episode'
} index_director.modify {substring(type=regex)|'index_description' "Ohjaajana:\s(.+?)\."} index_description.modify {remove(type=regex)|"(Ohjaajana:\s.+?\.)"} index_director.modify {replace|,|\|} index_actor.modify {substring(type=regex)|'index_description' "Pääosissa:\s(.+?)\."} index_description.modify {remove(type=regex)|"(Pääosissa:\s.+?\.)"} index_actor.modify {replace|,|\|} index_productiondate.modify {substring(type=regex)|'index_description' "(\d{4})\.\z"} index_description.modify {remove(type=regex)|"(\d{4}\.\z)"} index_description.modify {cleanup} * * the showdetail page only adds the starrating index_urlshow {url||
|">} index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate} * to speedup the downloading of the detail pages * title.modify {addstart|'index_title'} starrating.scrub {regex||type=\"radio\" class=\"auto-submit-star checked\" value=\"(\d{1})\"||} ** _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ** ##### CHANNEL FILE CREATION (only to create the xxx-channel.xml file) ** ** @auto_xml_channel_start *index_site_id.scrub {regex||(.+?)||} *scope.range {(channellist)|end} *index_site_id.modify {cleanup(removeduplicates=equal,100 link="index_site_channel")} *end_scope ** @auto_xml_channel_end