Manejando datos

Python and BeautifulSoup: extracting scores from Livescore.com

Posted by in Python

Today’s entrance is a suggestion from a reader. He want to extract the info of the score from LiveScore.com. Using a config file To increase customization, let’s create an ini file where you write key: URL as value, like this: [livescore] champ=http://www.livescore.com/soccer/champions-league/ spain=http://livescore.com/spain I have write only two URL, but you can add what you need, of course! Python’s code The basis of the code are: Read the URL from a config file (using the class ini2url) Download the HTML code Extract the score of the matches Show results Tha…read more

Python and BeautifulSoup: matches scores from Spanish League

Posted by in Python

A new chapter of web scrapping, and today my goal is to extract the results of matches played on Spanish League (first and second division). Let’s start by preparing a function that as the user to introduce a division and a “day”: The outcome is this: Division and day are the two needed data to keep on doing! I have prepared a list with the URL (they are this and this) for each division. I already explained how you can extract the HTML code with #Python, and the string that…read more

0

Python and BeautifulSoup: extracting prizes from Quiniela

Posted by in Python

Today’s entrance is the solution I wrote in order to accelerate (and reduce errors) some parts of my software about Quinielas (Quiniela is a spanish betting game, where you bet 15 matches). I have used Python with  BeautifulSoap. The goal is to scrap the prizes for the different categories, and the volumen fo money for the quiniela of the week. On previous entrances I showed you how to install it, but now, it’s time to experiement with a real problem I had. The objective The content to scrap is the…read more

0

5 big data premises

Posted by in Big Data

Under the definition Big Data, it’s implicit that we are dealing with high volume of data (or maybe not, if we’re dealing with small data). For the organizations, this aspect could become a special challenge if the data is distributed on the structure of the company. But it can be worst, for example, if the company has different data source, some of them with unstructured data. Here you have five premises of Big Data trying to answer this challange: Integrate high data volumes of transactional data and interaction. Trust your…read more