Hello, is it possible to capture the urls of the vertical and horizontal images?
https://www.movistarplus.es/deportes/futbol-espanyol/resumenes-laliga-hy...
-VERTICAL:
https://www.movistarplus.es/detalle?xhr=1&tipo=n&id=F4151988&p=1
https://www.movistarplus.es/recorte/n/detallegaleriav/F4151988
-HORIZONTAL:
https://www.movistarplus.es/detalle?xhr=1&tipo=n&id=F4151989&p=1
https://www.movistarplus.es/recorte/n/detallegaleriah/F4151989
Thank you so much
no idea where your getting those links from.
in short..
if the image must be...
on the index page(page that lists all the programs for the day)
or
on the details page for each program
then the answer is no.
also i have never seen a epg viewer that will display more than one show icon.
I create two xml's, one with vertical images and one with horizontal images, to use in different applications.
https://www.movistarplus.es/deportes/futbol-espanyol/copa-del-rey-23-24/...
I would like to capture this URL:
"https://www.movistarplus.es/recorte/n/galeria/F4148029"
well..
yes these images are on the show details page but..
the site isnt consistent with the portrait image being the first one and the landscape being the second.
this may be true for the few sports events i checked but for other shows like series or movies the first images may be a screenshot of a scene(there's more than just 2 images in the gallery).
theres nothing in the data that specifically says this image is a portrait/landscape/scene shot,ect
For me that is not important, I just need a way to capture that url, then with an "if" that detects "Deportes" I will organize the urls of the images.
this should get what you want.
read the revision change to see what i did.
Great, that's perfect for me, thank you very much.
Another thing, Movistar is duplicating events and the capture is failing.
the ini already has code to deal with duplicates.
looks fine to me,debug shows its removed..
btw it looks like you didnt change the timzone setting...
* @Remarks: use timezone of pc running webgrab for timezone setting
its currently set to my timezone.
I'm trying your modification for the "Deportes" images and it doesn't work for me. I'm using the latest version of webgrab V5.1.4.3, it's like if doesn't work.
Now I keep getting
[Error downloading page: Response status code does not indicate success: 400 (Bad Request).
What trouble I'm having ;-)
Trying with your ini directly I also have the same problem.
just tried it and still works fine for me.
did you change anything?
I have not changed anything, I have used the last file you have posted above as is.
the same file is uploaded to the siteini.pack
try that.
check your channel lines for a typo,especially the site_id="xx" value.
bad request usually means the request url isnt correct.
you could also try adding debug to the url_index line like below..
url_index {url(debug)|http://xxxxxx}
run webgrab and check your log file.
it will show the full request http address,paste it in browser and you should see data.
Trying with the sitepack ini with same result
url's created look fine so something else is going on with your linux(server i assume).
its being blocked(ip) possibly?
can you try from a different pc?
The most curious thing is that with an old version of webgrab it works perfectly.
i'm using 5.1.4.3 and it fine for me on windows/linux
did you use the installer when you installed wg on linux or are you running wg manually using custom paths.
it looks like your wg config,ect is in this directory?
/home/ubuntu/EPG/zzz
if so how are you running webgrab, custom script?
Yes, I am using a script with these two lines:
export OPENSSL_CONF=/home/ubuntu/EPG/zzz/bin.net/openssl.cnf
/home/ubuntu/.dotnet/dotnet /home/ubuntu/EPG/zzz/bin.net/WebGrab+Plus.dll /home/ubuntu/EPG/zzz
I use a VPS with Ubuntu, I have not changed any of the configuration, I also download from other sources such as Orange.es and Vodafone.pt, all of them work correctly.
Currently I am getting 400 (Bad Request) in Movistar.es and Tivify.tv failure.
I don't know where the problem could be
check the useragent in your webgrab config that dotnet uses.
is it the same as V3.x uses with mono?
Yes it is the same
Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36
on your vps see if curl return data.
curl https://www.movistarplus.es/programacion-tv/2024-03-29?v=json
ubuntu@ubuntu20-04:~/EPG/zzz$ curl https://www.movistarplus.es/programacion-tv/2024-03-29?v=json > 29.txt
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1816k 100 1816k 0 0 2865k 0 --:--:-- --:--:-- --:--:-- 2860k
ubuntu@ubuntu20-04:~/EPG/zzz$ curl https://www.movistarplus.es/programacion-tv/2024-03-30?v=json > 30.txt
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1871k 100 1871k 0 0 3565k 0 --:--:-- --:--:-- --:--:-- 3565k
ubuntu@ubuntu20-04:~/EPG/zzz$ curl https://www.movistarplus.es/programacion-tv/2024-03-31?v=json > 31.txt
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1877k 100 1877k 0 0 3952k 0 --:--:-- --:--:-- --:--:-- 3952k
ubuntu@ubuntu20-04:~/EPG/zzz$ curl https://www.movistarplus.es/programacion-tv/2024-04-01?v=json > 1.txt
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1883k 100 1883k 0 0 3642k 0 --:--:-- --:--:-- --:--:-- 3642k
ubuntu@ubuntu20-04:~/EPG/zzz$ curl https://www.movistarplus.es/programacion-tv/2024-04-02?v=json > 2.txt
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1825k 100 1825k 0 0 4021k 0 --:--:-- --:--:-- --:--:-- 4021k
that looks ok.
try adding this to one of the site {xxx} lines in the movistarplus.es.ini
grabengine=legacy
on the site {xxx} line settings are separated using a |
you should get the idea by looking at whats there already.
this will force webgrab to use the same request type as V3.x uses
With grabengine=legacy works well
THANK YOU
Does it affect the capture performance?
should'nt
the question is why does it work for me without that setting.
i think V5.x uses http client by default.
i forget what V3.x uses,only jan could answer this.
Ok, let's see if Jan can check this, because I notice that now the capture is slower.
could be why V5.x uses a different request format,its faster.
i sent him a message to look at this post.
OK, perfect, thanks a lot
V5 grabengine is based on HttpClient class by default. Setting legacy uses the old one based on the obsolete HttpWebRequest class (no longer supported by Microsoft) .. No idea why you have a problem with it .. make sure you have latest dotnet 7 runtime ..
The if/else issue .. The problem was mainly caused by a bug in the code that evaluated the if and else condition twice .. plus a few smaller issus. I fixed it .. but I need to look at loop because that might have a similar issue .. to be continued
It's all very strange, because it only happens with Movistar.es.ini and Tivify.tv.ini
Could it affect the time change that we will do on Sunday in Spain?