01-01-2024, 07:28 AM | #1 |
Junior Member
Posts: 1
Karma: 10
Join Date: Jan 2024
Device: Kindle
|
The Times and Sunday Times UK
There are several issues with the recipe for scraping this newspapers, as I think there have been some changes to the way the website works/and is structured.
(1) not including the full article, (2) random bold writing saying 'Sponsored', (3) the related articles section should be removed or reformatted, (4) duplication and wrongly formatted byline and date, (5) separating the byline from the article summary, (6) separating and distinguishing the caption in italics The main issue is not including the full article, which I think is because they have changed their login page from 'login.thetimes.co.uk' to 'account.thetimes.co.uk'; which makes it harder to scrape. The other issues can probably be solved by updating the recipe to solve the formatting issues, but I am not familiar with this. Has anyone made a fix for any of these probems? |
01-02-2024, 05:04 AM | #2 |
Evangelist
Posts: 462
Karma: 82692
Join Date: May 2021
Device: kindle
|
i can try, pm me your login details.
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
The Times & Sunday Times (UK) recipe articles incomplete | Maxijmj | Recipes | 1 | 12-12-2023 06:41 AM |
The Times & Sunday Times | Rich | Recipes | 1 | 07-03-2023 04:27 PM |
Fixing Sunday Times UK - ToC creation | bobbysteel | Recipes | 1 | 01-01-2017 07:27 PM |
Request: Sunday Times | us06154 | Recipes | 0 | 07-13-2012 03:44 AM |
NY Times Sunday Magazine ? | MichaelMSeattle | Recipes | 2 | 11-18-2010 02:49 PM |