I found the tvguide.co.uk was not working due to site changes and thought I would try and help with making an updated ini file
Basically the detail page is now a new name and format so it was only giving summary information
The initial change was line 122
now --- index_urlshow.scrub {regex||<a href=\"(http\://www.tvguide.co.uk/detail/.+?)\"||}
was --- *index_urlshow.scrub {regex||<a href=\"(http\://watch.tvguide.co.uk/engage/.+?)\"||}
It is now getting certificate and details of actor etc - however the actor descriptions on the website contain a whole mix of presentor etc
I am getting stuck on how extract from the <actor> those that are presentor etc and put into the <presenter> <commentator> ... tags
It seems like that should be simple ?! Help appreciated and if there are better ways of doing what I am so far please point out to me
Hopefully then the ini file can be updated in the website for others to use
ini file, config file and zipped tvguide.xml output file attached
(webgrab+ version 1.1.1/55.27 on Windows 10)
Thanks for a quick response, I have downloaded the updated info - will look at what you have done and aim to learn.
I will make an adjustment for my own use with MediaPortal as that does not make use of the <actor> <rating> etc tags in the TV guide so I will add these to the description so it shows up in the MediaPortal TV guide
The category part of this has multi entries and I was wanting to reduce the large number of categories and make it a single entry
ie currently it has the following
<category>General Movie</category>
<category>Drama</category>
And I just want the first entry only
Current ini file has this for the category
index_category.scrub {regex ||Category </span><span class=\"programmetext\">(.*?)</span></a><br>||}
index_category.modify {remove|/Factual Topics}
index_category.modify {remove|/Folkloric}
index_category.modify {remove|/Environment}
index_category.modify {replace|/|\|}
Is it easy to make it a single catefory of just the first entry ?
many thanks again