tvsou Unable to obtain data, please help write an ini
m.tvsou.com/epg/94263ee0/
<span>07:00</span><a href='//m.51livetv.com/wiki/lm_440864/' target='_blank'>栏目</a><script type='text/javascript'>judgeTime('1650927600000','//www.51livetv.com/channel/1342/','1650928800000','//m.51livetv.com/wiki/l...);</script></li><li><span>07:20</span><a href='//m.51livetv.com/wiki/zzdms/' target='_blank'>郑州大民生</a><script type='text/javascript'>judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/z...);</script></li><li><span>08:00</span>郑州新闻联播/直通政务<script type='text/javascript'>judgeTime('1650931200000','//www.51livetv.com/channel/1342/','1650932700000','');</script></li><li><span>08:25</span>县区政务<script type='text/javascript'>judgeTime('1650932700000','//www.51livetv.com/channel/1342/','1650935400000','');</script></li><li><span>09:10</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门31</a><a href='//m.51livetv.com/fenji/jzyzm_31.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第31集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script type='text/javascript'>judgeTime('1650935400000','//www.51livetv.com/channel/1342/','1650937800000','//m.51livetv.com/wiki/j...);</script></li><li><span>09:50</span><a href='//m.51livetv.com/wiki/jzyzm/' target='_blank'>电视剧:决战燕子门32</a><a href='//m.51livetv.com/fenji/jzyzm_32.htm' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">第32集剧情</a><a href='//m.51livetv.com/yyb/jzyzm/' target='_blank' target="_blank" style="font-size: 13px;color: #d90024;margin-left: 10px;">演员表</a><script
look at ur url_index.
u have a extra | at the end.
in this case it doesnt hurt anything but it shouldnt be these.
check your weebgrab log,it will show u that all shows were skipped because of missing title.
ur title scrub is bad.
sites like these can be confusing to new learners becasue in this case all the lines start wih <a href
but on this site the title is always the first one.
so just scrub it with single as u did and dont need to be fancy with the scrub as using single will always keep the first result.
ndex_title.scrub {single|<a href=|>|</a>|</a>}
index_title.scrub {single|||}
OK after replacement, thank you!
i just noticed in your channel creation section u have.
index_site_id.modify {cleanup(removeduplicates=equal,100)}
it should be like this
index_site_id.modify {cleanup(removeduplicates link="index_site_channel")}
with what u have it will only remove duplicates in the site_id value and not the corresponding duplicate channel name.
=equal,100 is the default action so u dont need to specify it.doesnt hurt anything if you do though.
The specified objects of individual titles are different, and there is a lack of programs. How can I write them completely.
{{{{07:20郑州大民生judgeTime('1650928800000','//www.51livetv.com/channel/1342/','1650931200000','//m.51livetv.com/wiki/zzdms/');
Info ] Group (0) :
[ Info ] update requested for - 1 - out of - 1 - channels for 1 day(s)
[ Debug ]
[ Info ] ( 1/1 ) MM.TVSOU.COM -- chan. (xmltv_id=郑州时政) -- mode Force
[ Debug ] skipped show without a title at 26/04/2022 08:00:00
[ Debug ] skipped show without a title at 26/04/2022 08:25:00
[ Debug ] skipped show without a title at 26/04/2022 12:22:00
[ Debug ] skipped show without a title at 26/04/2022 19:33:00
[ Debug ] skipped show without a title at 26/04/2022 19:55:00
[ Debug ] skipped show without a title at 26/04/2022 22:00:00
[ Debug ] skipped show without a title at 26/04/2022 22:25:00
[ Debug ] skipped : last show, no next startime to use as stop
[ Info ]
[ Debug ]
[ Debug ] 26 shows in 1 channels
[ Debug ] 0 updated shows
[ Debug ] 26 new shows added
[ Info ]
[ Info ]
[ ] Job finished at 26/04/2022 11:20:52 done in 1s
家国记忆
栏目
郑州大民生
电视剧:决战燕子门31
家国记忆
栏目
郑州大民生
电视剧:决战燕子门31
the mobile site is a mess.
title are in multiple different tags.
have you checked the non mobile site?
i had a quick look and things seem to use all the same tags.
i think you should try that.
non mobile site different labels are also used, and the reaction is strong。How to write?
https://m.tvsou.com/epg/94263ee0/w2
ndex_title.scrub {single| | < / a > | < / a > }
[ Info ] found: /root/.wg++/siteini.pack/China/tvsou.com.ini -- Revision 03
your not using the correct ini,revision 3 is the old one.
the new one is revision 4.
after you download the file,you have to rename them and remove the underscores,webgrab add these to all uploads for security reasons.
works fine for me..
found: /raiddata/0/NAS_WebGrab/siteini.user/China/tvsou.com.ini -- Revision 04 <====== new file revison number
update requested for - 1 - out of - 1 - channels for 1 day(s)
( 1/1 ) TVSOU.COM -- chan. (xmltv_id=河南: 郑州电视台) -- mode Force
innnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
1.67 sec/update
OK!OK!OK!Thanks again!Already successful
It's my carelessness.
What statements need to be added if it is regenerated into a GZ format?
webgrab cannot do this.
you have to comprerss the file after webgrab has completed.
The program name is not displayed after the TV play,Incomplete.The program introduction is no longer displayed.How to modify data captured for 2 days?
https://www.tvsou.com/epg/94263ee0/
programme start="20220427003000 +0800" stop="20220427013000 +0800" channel="hnzzsz"
title lang="zh">电视剧/title
programme
programme start="20220427013000 +0800" stop="20220427030000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427030000 +0800" stop="20220427040000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427040000 +0800" stop="20220427050000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title
programme
programme start="20220427050000 +0800" stop="20220427055000 +0800" channel="hnzzsz"
title lang="zh"电视剧/title>
programme
programme start="20220427055000 +0800" stop="20220427070000 +0800" channel="hnzzsz"
title lang="zh"家国记忆/title
did you do any reading before using webgrab?
http://webgrabplus.com/documentation/configuration/webgrabconfigxml#conf...
the main site(non mobile) was having issues this morning,details page was not getting downloaded.
i made some tweaks to the ini and added did the mobile site also which seemed to work better.
both are available on the epg channels page under china or do a siteini.pack update.
mobile site Some codes of the program appear. Can they be shielded? Add what shielding.Refer to the data I downloaded。
non mobile There is no data for individual channels, and the detailed information is still not available.
title lang="zh">聚焦双改<script type='text/javascript'>judgeTime('1651209000000','//www.51livetv.com/channel/1343/','1651209300000','');</script>
does this look ok?
same 3 channels,regular site and mobile.
looks the same to me.
Some channels EPG comes with some web source code,You try 94263ee0, some programs come with source code.
think i got that fixed also..
/ span > | | < / td> | < / td > Change to
/ span > | | < script type= | > | < script type= | > Garbled code is normal。Upload an attachment, do you see it right?
Next, prepare to donate members
that wont work because that scrub would fail for channel with data like this..
<li>
<span>18:30</span>
省新闻<td></td>
</li>
and thats what its used for.
try these
Modified TD as script type = and unmodified 2 comparisons.TD not found on mobile terminal.
The attachment you uploaded now is completely normal, and the test passed. Thanks again!
i made some small changes.
i decided to not separate episode number from subtitle,wg sometimes messes this up.
added channel logo for non mobile site,mobile site does not have them.
files updated above,no revision number change.
https://epg.sports8.net/
Can this station also write an ini for standby.
already had it done.
Thank you!
https://lighttv.tvmao.com/qa/qachannelschedule?epgCode=HNTV2&op=getProgr...
Can the EPG interface grab? The channel name is https://www.tvmao.com/program/HNTV-HNTV2-w5.html
such as:FJTV2 CCTV1 BTV1
where is the first link with the json data from?
must be from a app?
do you have the rest of the links for this also like channel,city/region,details page link?
the second url cannot be used because it only shows part of the day schedule,the rest of the day is generated in javascript code and its a encrypted string thats base64 encoded.
even if it could be figured out webgrab cannot grab 2 url's to get the full day schedule.
Irst link with It is the interface address of an IPTV, and it is the data in the second one. Where 《epgCode=channel name》, the channel name is the program name of the second website.Live program EPG of the day。
this uses tvmao.com
its very slow because to get the full day schedule the epg grid page needs to be used and its in 2 hour sections so 12 pages need to be grabbed to get 1 day of epg.
its title only also.
you could could create a ini to use the lighttv.tvmao.com url you posted above as the site_id="xxx" does have the correct channel ids it uses,you just need to change the channel creation section to keep only that or substring the value you need in scope=urlindex.
hint: global temp_2 already does this but its used to separate the correct channel in the showsplit and not for the url_index.
u seem to be somewhat knowledged in how ini work,it shouldnt be hard to figure this out.
I am a Chinese user ,Donation Tips:Donations to this recipient are not supported in this country or region.
http://www.epg.huan.tv/henan/channel_index
The notice of this station is more accurate, can it be adapted to an INI