Hello,
As I was creating my first SiteINI file, I stumbled upon some optional strings found in different elements such as Title.
For exmple I came across a site with these lines as title:
<h3><a href="http://www.webgrabplus.com/en/articles/Superman-Returns.html">Superman Returns</a></h3> <h3>The Insider</h3> <h3>Entertainment Tonight </h3> <h3><a href="http://www.webgrabplus.com/en/articles/Riding-in-Cars-with-Boys.html">Riding in Cars with Boys</a></h3>
As it is seen, some titles have this string: <a href="http://www.webgrabplus.com/%28.%2B%29"> before title and this </a> after title. How can I program WebGrabPlus to not include these optional strings in my element (title in this case)?
PS: I know a little about RegEx if it may help. In RegEx I can use Capture Group function like this:
Find:
<h3>(<a href="http://www.webgrabplus.com/%28.%2B%29">)?(.+)( <\a>)?</h3>
Replace:
<h3>\3</h3>
But I don't know how to do it with WebGrab++
Thanks so much. I will try both methods. Really appreciate it!
By the way, does XMLTVs support Arabic language? I could extract Arabic information (in which words places were messed up) but KODI wouldn't show Arabic at all. Is there an app to read Arabic XMLTV? Or should I change something in the XMLTV grabbing process?