ABSTRACT: When we think about web news as a source of information it seems that there's nothing new to talk about. Google News et similia are out there for years and everybody can access that huge amount of information, mostly for free. However, if you want to extract real value from news articles, things are getting much more complicated. At SpazioDati we are focused on collecting as much information as possible about all Italian companies from many different sources and news is one of the richest, but at the same time hardest, kinds of sources we are dealing with. So we built Sedano, our news processing pipeline that is able to ingest, clean, deduplicate, annotate, classify and cluster several thousands of news articles per day and make them available to our users. We will talk about the challenges we faced, the solutions we implemented, and the open issues we are currently working on. BIO: Ugo Scaiella is Software Engineering Manager at SpazioDati where he leads a team of more than 30 highly skilled and talented engineers and data scientists. Previously, he spent several years developing and playing with Machine Learning and Information Retrieval systems both in industry and academic environments. When not dealing with crazy deadlines and an insane amount of projects simultaneously, you might find him grilling a wagyu steak or waiting for the pork ribs to reach 98°C inside.