TV.COM gives me error in time grabbing
[ Info ] channel (xmltv_id=HBO HDTV (East) (HBOHD)) site -- TV.COM -- mode incremental
[ Error ] time parsing error : String was not recognized as a valid DateTime.
[ Error ] stoptime time scrubbed : 1446741000
[ Error ] computer date/time format: 05/11/2015 05:24:01 p.m.
[ Error ] execution stopped
has to do spmething with :
index_start.scrub {regex||data-start="(\d{10})">||}
disable that option and it works (no programme start end tho')
didn't realize ini needs last beta for work >= (* @MinSWversion: 1.1.1/55.27)
it's all ok now
my bad :(
Same issue here. I do have the correct WebGrab.exe version running though
EDIT:
It all does work with version higher than the min software version. The latest beta does work: V1.56.8
Hi, I'm not able to grab EPG data from TV.com, I'm getting the following error message (I have the latest update installed):
Please let me know if there is anything I may be doing wrong here.
Thank you.
It does work now with no issues, except that all the movie and TV shows titles are in between slashes i.e "/Back\/to\/the\/\Future/\"
Do you know why this could be happening ? It would be nice to avoid having these brackes in the middle of the movie or TV show title.
Thanks in advance.
I don't know how to work with regular expressions but I'm more than sure that this issue has to do with either one of the following lines from the ini file:
index_start.scrub {regex||data-start="(\d{10})">||}
index_title.scrub {regex||<div class="title">(.+?)</div>||}
index_description.scrub {regex||<div class="desc">(.*?)</div>||}
*
index_temp_1.scrub {regex||data-tmsid="rvp:(\d+?)"||} * id
index_urlshow.modify {set('index_temp_1' not "")|http://www.tv.com/listings/event/?EventTmsId=rvp%3A'index_temp_1'}
http://www.tv.com/listings/event/?EventTmsId=rvp%3A1952005171
index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate}* to speedup the downloading of the detail pages
title.scrub {regex||<h1>(?:<a href=.+?>)?(.+?)(?:</a>)?</h1>||}
subtitle.scrub {regex||<h2>(?:<a href=.+?>)?(.+?)(?:</a>)?</h2>||}
description.scrub {regex||<div class=\\"description\\">(.*?)</div>||}
description.modify {remove|\}
showicon.scrub {regex||data-image=\\"(http://.+?\.jpg)\\"||}
category.modify {substring(type=regex)|'title' "<label>(.+?):\s?</label>"}
title.modify {remove(type=regex)|"(<label>.+?</label>)"}
works well here:
there's a difference between my ini and yours
yours: index_start.scrub {regex||data-start="(\d{10})">||}
mine: index_start.scrub {regex||data-start="(\d{10})"||}
Thanks for your comment. It did improve with a few channels, but I'm still seeing the following characters before the actual name of the movie, this only happens in movie channels such as HBO, Showtime, Starz, etc.
I've executed WG++ many times, but the outcome has always been the same: "<a href=\"/movies/transformers/\">Transformers" as the title.
I would appreciate if anyone could share their thoughts on how to fix this issue.
Thanks.
dirty fix made:
the problem is that title states 2 different arguments, one for movies another for regular shows, so regex comes in handy but im not good at that so regex Xperts do your best to fix it
in the meanwhile you can use this ini, theres a lot of (?) in titles (differs), readed somewhere in this forum a fix for that (like: title.modify preffer blah blah...)
I've adjusted the siteini. Should be fine now.
mighty Francis did it
here's another regex solution (just for study)
Thank you both. I will try this today and let you know the outcome.