site -- TV.COM not working

12 posts / 0 new

Last post

Thu, 2015-11-05 22:30

1NSdbZVbpZDX

Offline

Joined: 10 years

Last seen: 7 years

site -- TV.COM not working

TV.COM gives me error in time grabbing

[ Info ] channel (xmltv_id=HBO HDTV (East) (HBOHD)) site -- TV.COM -- mode incremental
[ Error ] time parsing error : String was not recognized as a valid DateTime.
[ Error ] stoptime time scrubbed : 1446741000
[ Error ] computer date/time format: 05/11/2015 05:24:01 p.m.
[ Error ] execution stopped

has to do spmething with :

index_start.scrub {regex||data-start="(\d{10})">||}

disable that option and it works (no programme start end tho')

Attachments:

WebGrab.log_.txt

Tue, 2015-11-10 10:39

1NSdbZVbpZDX

Offline

Joined: 10 years

Last seen: 7 years

didn't realize ini needs last beta for work >= (* @MinSWversion: 1.1.1/55.27)

it's all ok now

my bad :(

Mon, 2015-11-16 14:22

markdark

Offline

Joined: 9 years

Last seen: 9 years

Same issue here. I do have the correct WebGrab.exe version running though

EDIT:

It all does work with version higher than the min software version. The latest beta does work: V1.56.8

Thu, 2015-12-03 11:03

Mariano

Offline

Joined: 9 years

Last seen: 8 years

Hi, I'm not able to grab EPG data from TV.com, I'm getting the following error message (I have the latest update installed):

[ Debug ] WebGrab+Plus/w MDB & REX Postprocess -- version 1.1.1/55.27 -- Jan van Straaten
[ Debug ] --------------------------------------------------------------------------------
[ Debug ] job started at 03/12/2015 06:47:08 a.m.
[ Debug ]
[ Debug ] reading config file: C:\ProgramData\ServerCare\WebGrab\WebGrab++.config.xml
[ Debug ] loading timezone data
[ Debug ] embedded timezones source: WGconsole.WG.Common.timezonesdata.txt
[ Debug ] found: C:\ProgramData\ServerCare\WebGrab\tv.com.ini -- Revision 0
[ Debug ] reading cookie file : C:\ProgramData\ServerCare\WebGrab\tv.com.cookies.txt
[ Debug ]
[ Debug ] running on: Microsoft Windows NT 6.1.7601 Service Pack 1
[ Debug ] environment: 4.0.30319.1
[ Debug ] xmltv input file - C:\ProgramData\ServerCare\WebGrab\guide.xml - found
[ Debug ] 0 superfluous shows removed
[ Info ]
[ Info ] update requested for - 1 - out of - 1 - channels for 2 day(s)
[ Info ] update mode - set per individual channel
[ Info ]
[ Info ]
[ Info ] i=index .=same c=change g=gab r=replace n=new
[ Info ]
[ Debug ]
[ Info ] channel (xmltv_id=Spike TV) site -- TV.COM -- mode incremental
[ Error ] time parsing error : String was not recognized as a valid DateTime.
[ Error ] stoptime time scrubbed : 1449118800
[ Error ] computer date/time format: 12/03/2015 06:47:09 a.m.
[ Error ] execution stopped

Please let me know if there is anything I may be doing wrong here.

Thank you.

Attachments:

tv.com_.cookies.txt

WebGrab.log_.txt

Mon, 2015-12-28 16:20

Mariano

Offline

Joined: 9 years

Last seen: 8 years

It does work now with no issues, except that all the movie and TV shows titles are in between slashes i.e "/Back\/to\/the\/\Future/\"

Do you know why this could be happening ? It would be nice to avoid having these brackes in the middle of the movie or TV show title.

Thanks in advance.

Wed, 2015-12-30 17:05

Mariano

Offline

Joined: 9 years

Last seen: 8 years

I don't know how to work with regular expressions but I'm more than sure that this issue has to do with either one of the following lines from the ini file:

index_start.scrub {regex||data-start="(\d{10})">||}
index_title.scrub {regex||<div class="title">(.+?)</div>||}
index_description.scrub {regex||<div class="desc">(.*?)</div>||}
*
index_temp_1.scrub {regex||data-tmsid="rvp:(\d+?)"||} * id
index_urlshow.modify {set('index_temp_1' not "")|http://www.tv.com/listings/event/?EventTmsId=rvp%3A'index_temp_1'}
http://www.tv.com/listings/event/?EventTmsId=rvp%3A1952005171

index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate}* to speedup the downloading of the detail pages

title.scrub {regex||<h1>(?:<a href=.+?>)?(.+?)(?:</a>)?</h1>||}
subtitle.scrub {regex||<h2>(?:<a href=.+?>)?(.+?)(?:</a>)?</h2>||}
description.scrub {regex||<div class=\\"description\\">(.*?)</div>||}
description.modify {remove|\}
showicon.scrub {regex||data-image=\\"(http://.+?\.jpg)\\"||}
category.modify {substring(type=regex)|'title' "<label>(.+?):\s?</label>"}
title.modify {remove(type=regex)|"(<label>.+?</label>)"}

Attachments:

tv.com_.ini

Fri, 2016-01-01 17:46

1NSdbZVbpZDX

Offline

Joined: 10 years

Last seen: 7 years

works well here:

there's a difference between my ini and yours

yours: index_start.scrub {regex||data-start="(\d{10})">||}

mine: index_start.scrub {regex||data-start="(\d{10})"||}

Attachments:

WebGrab.log_.txt

guide.xml

Mon, 2016-01-04 20:16

Mariano

Offline

Joined: 9 years

Last seen: 8 years

Thanks for your comment. It did improve with a few channels, but I'm still seeing the following characters before the actual name of the movie, this only happens in movie channels such as HBO, Showtime, Starz, etc.

I've executed WG++ many times, but the outcome has always been the same: "<a href=\"/movies/transformers/\">Transformers" as the title.

I would appreciate if anyone could share their thoughts on how to fix this issue.

Thanks.

Tue, 2016-01-05 02:08

1NSdbZVbpZDX

Offline

Joined: 10 years

Last seen: 7 years

dirty fix made:

the problem is that title states 2 different arguments, one for movies another for regular shows, so regex comes in handy but im not good at that so regex Xperts do your best to fix it

in the meanwhile you can use this ini, theres a lot of (?) in titles (differs), readed somewhere in this forum a fix for that (like: title.modify preffer blah blah...)

Attachments:

tv.com_.ini

Tue, 2016-01-05 08:48

#10

francis

Offline

Joined: 12 years

Last seen: 8 months

Is the support helpful?

I've adjusted the siteini. Should be fine now.

Tue, 2016-01-05 14:42

#11

1NSdbZVbpZDX

Offline

Joined: 10 years

Last seen: 7 years

mighty Francis did it

here's another regex solution (just for study)

Attachments:

tv.com_.ini

Tue, 2016-01-05 16:19

#12

Mariano

Offline

Joined: 9 years

Last seen: 8 years

Thank you both. I will try this today and let you know the outcome.

WebGrab+Plus

Search form

You are here

site -- TV.COM not working