tvsou Unable to obtain data, please help write an ini

34 posts / 0 new

Last post

Tue, 2022-04-26 03:47

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

tvsou Unable to obtain data, please help write an ini

m.tvsou.com/epg/94263ee0/

<span>07:00</span><a href='//m.51livetv.com/wiki/lm_440864/' target='_blank'>栏目</a><script type='text/javascript'>judgeTime('1650927600000','//www.51livetv.com/channel/1342/','1650928800000','//m.51livetv.com/wiki/l...);</script></li><li><span>07:20</span><a href='//m.51livetv.com/wiki/zzdms/' target='_blank'>郑州大民生</a><script type='text/javascript'>judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/z...);</script></li><li><span>08:00</span>郑州新闻联播/直通政务<script type='text/javascript'>judgeTime('1650931200000','//www.51livetv.com/channel/1342/','1650932700000','');</script></li><li><span>08:25</span>县区政务<script type='text/javascript'>judgeTime('1650932700000','//www.51livetv.com/channel/1342/','1650935400000','');</script></li><li><span>09:10</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门31</a><a href='//m.51livetv.com/fenji/jzyzm_31.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第31集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script type='text/javascript'>judgeTime('1650935400000','//www.51livetv.com/channel/1342/','1650937800000','//m.51livetv.com/wiki/j...);</script></li><li><span>09:50</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门32</a><a href='//m.51livetv.com/fenji/jzyzm_32.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第32集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script

Attachments:

QQJie_Tu_20220426094855.jpg

mm.tvsou_.com_.ini

WebGrab.log_.txt

guide.xml

Tue, 2022-04-26 11:10

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

look at ur url_index.
u have a extra | at the end.
in this case it doesnt hurt anything but it shouldnt be these.

check your weebgrab log,it will show u that all shows were skipped because of missing title.
ur title scrub is bad.
sites like these can be confusing to new learners becasue in this case all the lines start wih <a href
but on this site the title is always the first one.
so just scrub it with single as u did and dont need to be fancy with the scrub as using single will always keep the first result.

ndex_title.scrub {single|<a href=|>|</a>|</a>}

Tue, 2022-04-26 12:23

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

index_title.scrub {single|||}

OK after replacement, thank you!

Attachments:

QQJie_Tu_20220426182249.jpg

Tue, 2022-04-26 12:29

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

i just noticed in your channel creation section u have.

index_site_id.modify {cleanup(removeduplicates=equal,100)}

it should be like this

index_site_id.modify {cleanup(removeduplicates link="index_site_channel")}

with what u have it will only remove duplicates in the site_id value and not the corresponding duplicate channel name.

=equal,100 is the default action so u dont need to specify it.doesnt hurt anything if you do though.

Tue, 2022-04-26 13:36

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

The specified objects of individual titles are different, and there is a lack of programs. How can I write them completely.

{{{{07:20郑州大民生judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/zzdms/');

08:00郑州新闻联播/直通政务judgeTime('1650931200000','//www.51livetv.com/channel/1342/','1650932700000','');

08:25县区政务judgeTime('1650932700000','//www.51livetv.com/channel/1342/','1650935400000','');

09:10电视剧:决战燕子门31}}}}}

Info ] Group (0) :
[ Info ] update requested for - 1 - out of - 1 - channels for 1 day(s)
[ Debug ]
[ Info ] ( 1/1 ) MM.TVSOU.COM -- chan. (xmltv_id=郑州时政) -- mode Force
[ Debug ] skipped show without a title at 26/04/2022 08:00:00
[ Debug ] skipped show without a title at 26/04/2022 08:25:00
[ Debug ] skipped show without a title at 26/04/2022 12:22:00
[ Debug ] skipped show without a title at 26/04/2022 19:33:00
[ Debug ] skipped show without a title at 26/04/2022 19:55:00
[ Debug ] skipped show without a title at 26/04/2022 22:00:00
[ Debug ] skipped show without a title at 26/04/2022 22:25:00
[ Debug ] skipped : last show, no next startime to use as stop
[ Info ]
[ Debug ]
[ Debug ] 26 shows in 1 channels
[ Debug ] 0 updated shows
[ Debug ] 26 new shows added
[ Info ]
[ Info ]
[ ] Job finished at 26/04/2022 11:20:52 done in 1s

家国记忆

栏目

郑州大民生

电视剧:决战燕子门31

家国记忆

栏目

郑州大民生

电视剧:决战燕子门31

Attachments:

test.txt

test2.txt

Tue, 2022-04-26 14:12

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

the mobile site is a mess.
title are in multiple different tags.
have you checked the non mobile site?
i had a quick look and things seem to use all the same tags.
i think you should try that.

Tue, 2022-04-26 14:38

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

non mobile site different labels are also used, and the reaction is strong。How to write？

https://m.tvsou.com/epg/94263ee0/w2
ndex_title.scrub {single| | < / a > | < / a > }

Wed, 2022-04-27 00:52

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

[ Info ] found: /root/.wg++/siteini.pack/China/tvsou.com.ini -- Revision 03

your not using the correct ini,revision 3 is the old one.
the new one is revision 4.
after you download the file,you have to rename them and remove the underscores,webgrab add these to all uploads for security reasons.

works fine for me..
found: /raiddata/0/NAS_WebGrab/siteini.user/China/tvsou.com.ini -- Revision 04 <====== new file revison number

update requested for - 1 - out of - 1 - channels for 1 day(s)
( 1/1 ) TVSOU.COM -- chan. (xmltv_id=河南: 郑州电视台) -- mode Force
innnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
1.67 sec/update

Wed, 2022-04-27 02:26

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

OK!OK!OK!Thanks again！Already successful
It's my carelessness.

Wed, 2022-04-27 04:32

#10

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

What statements need to be added if it is regenerated into a GZ format?

Wed, 2022-04-27 09:42

#11

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

webgrab cannot do this.
you have to comprerss the file after webgrab has completed.

Wed, 2022-04-27 09:51

#12

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

The program name is not displayed after the TV play，Incomplete.The program introduction is no longer displayed.How to modify data captured for 2 days？
https://www.tvsou.com/epg/94263ee0/
programme start="20220427003000 +0800" stop="20220427013000 +0800" channel="hnzzsz"
title lang="zh">电视剧/title
programme
programme start="20220427013000 +0800" stop="20220427030000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427030000 +0800" stop="20220427040000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427040000 +0800" stop="20220427050000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427050000 +0800" stop="20220427055000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title>
programme
programme start="20220427055000 +0800" stop="20220427070000 +0800" channel="hnzzsz"
title lang="zh"家国记忆/title

Attachments:

WebGrab.log_.txt

guide.xml

tvsou.com_.ini

Wed, 2022-04-27 09:59

#13

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

did you do any reading before using webgrab?
http://webgrabplus.com/documentation/configuration/webgrabconfigxml#conf...

Wed, 2022-04-27 18:35

#14

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

the main site(non mobile) was having issues this morning,details page was not getting downloaded.

i made some tweaks to the ini and added did the mobile site also which seemed to work better.
both are available on the epg channels page under china or do a siteini.pack update.

Thu, 2022-04-28 07:29

#15

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

mobile site Some codes of the program appear. Can they be shielded? Add what shielding.Refer to the data I downloaded。

non mobile There is no data for individual channels, and the detailed information is still not available.

title lang="zh">聚焦双改<script type='text/javascript'>judgeTime('1651209000000','//www.51livetv.com/channel/1343/','1651209300000','');</script>

Attachments:

epg.xml

Thu, 2022-04-28 11:44

#16

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

does this look ok?
same 3 channels,regular site and mobile.
looks the same to me.

Attachments:

guide.xml

Thu, 2022-04-28 11:52

#17

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Some channels EPG comes with some web source code,You try 94263ee0, some programs come with source code.

Thu, 2022-04-28 11:59

#18

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

think i got that fixed also..

Attachments:

guide.xml

Thu, 2022-04-28 12:18

#19

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Attachments:

m.tvsou_.com_.ini

Thu, 2022-04-28 12:34

#20

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Next, prepare to donate members

Thu, 2022-04-28 12:35

#21

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

that wont work because that scrub would fail for channel with data like this..

and thats what its used for.

Thu, 2022-04-28 13:26

#22

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

try these

Attachments:

tvsou.com_.ini

m.tvsou_.com_.ini

Thu, 2022-04-28 13:05

#23

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Modified TD as script type = and unmodified 2 comparisons.TD not found on mobile terminal.

Attachments:

not_changed_epg.xml

Modified_epg.xml

Thu, 2022-04-28 13:16

#24

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

The attachment you uploaded now is completely normal, and the test passed. Thanks again!

Thu, 2022-04-28 13:28

#25

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

i made some small changes.
i decided to not separate episode number from subtitle,wg sometimes messes this up.
added channel logo for non mobile site,mobile site does not have them.

files updated above,no revision number change.

Fri, 2022-04-29 03:16

#26

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

https://epg.sports8.net/

Can this station also write an ini for standby.

Fri, 2022-04-29 03:36

#27

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

already had it done.

Attachments:

sports8.net_.ini

sports8.net_.channels.xml

Fri, 2022-04-29 04:28

#28

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Thank you!

Fri, 2022-04-29 08:16

#29

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

https://lighttv.tvmao.com/qa/qachannelschedule?epgCode=HNTV2&op=getProgr...
Can the EPG interface grab? The channel name is https://www.tvmao.com/program/HNTV-HNTV2-w5.html
such as：FJTV2 CCTV1 BTV1

Fri, 2022-04-29 12:03

#30

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

where is the first link with the json data from?
must be from a app?
do you have the rest of the links for this also like channel,city/region,details page link?

the second url cannot be used because it only shows part of the day schedule,the rest of the day is generated in javascript code and its a encrypted string thats base64 encoded.
even if it could be figured out webgrab cannot grab 2 url's to get the full day schedule.

Fri, 2022-04-29 12:51

#31

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

Irst link with It is the interface address of an IPTV, and it is the data in the second one. Where 《epgCode=channel name》, the channel name is the program name of the second website.Live program EPG of the day。

Fri, 2022-04-29 15:56

#32

Blackbear199

Offline

Joined: 9 years

Last seen: 37 min

this uses tvmao.com
its very slow because to get the full day schedule the epg grid page needs to be used and its in 2 hour sections so 12 pages need to be grabbed to get 1 day of epg.

its title only also.
you could could create a ini to use the lighttv.tvmao.com url you posted above as the site_id="xxx" does have the correct channel ids it uses,you just need to change the channel creation section to keep only that or substring the value you need in scope=urlindex.
hint: global temp_2 already does this but its used to separate the correct channel in the showsplit and not for the url_index.

u seem to be somewhat knowledged in how ini work,it shouldnt be hard to figure this out.

Attachments:

tvmao.com_.ini

tvmao.com_.channels.xml

Mon, 2022-05-02 07:39

#33

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

I am a Chinese user ，Donation Tips:Donations to this recipient are not supported in this country or region.

Thu, 2022-10-06 15:21

#34

kongjun95848

Offline

Joined: 2 years

Last seen: 10 months

http://www.epg.huan.tv/henan/channel_index
The notice of this station is more accurate, can it be adapted to an INI

WebGrab+Plus

You are here

tvsou Unable to obtain data, please help write an ini

WebGrab+Plus

Search form

You are here

tvsou Unable to obtain data, please help write an ini