1. Online News Exhibition

Following the changes in working habits during the COVID pandemic, your local library has decided it is time to make the newspaper archives available online. They are not sure what the response will be so they are going to trial a new system on a small set of their data.

The library database has a table articles with a record of all articles published in the newspaper. However, the table will need some preparation before the library can use it in the new system. They would like to focus the initial trial on articles from 2014 and 2015 only, and for just one journalist whose ID in the database is 1754.

The final results should be sorted from earliest published date to the most recent published date.

Below are the requirements the library has given to you for the data they need.

Requirements Data Type Column Alias
Unique ID of each article, where the first 4 digits represents the journalist ID. TEXT article_key
Category of the article, in uppercase. 'Food & Drink' and 'Wellness' should be collapsed into a 'Lifestyle' category. TEXT category
Headline of the article, without the subtitle that appears after the colon (:). For example, 'It Could Happen To You: A Story of SEC Overreach' becomes 'It Could Happen To You'. TEXT headline
Subtitles of the article, without the headline that appears before the colon (:). For example, 'It Could Happen to You: A Story of SEC Overreach' becomes 'A Story of SEC Overreach'. In cases where a headline does not have a subtitle, the missing value should read ‘None’. TEXT subtitle
Description of the article. The description should be reduced to the first sentence, up to and including the first period (.) from the short_description column. TEXT description
Keywords of the article. Multiple keywords will be seperated with a dash (-). If the keyword is missing, the value should read ‘Unknown’. TEXT keywords
Date the article was published. The data should be displayed in the format resembling the following: ‘Aug 13, 1995’. TEXT publish_date