Week 8 – Pre Mortem

--Originally published at TC3045 – Sagnelli's blog

Howdy partners,

It’s me again with another update on what we will be working this week. Last week, I worked on mining all tweets from an account on Twitter. This week, I will be polishing what I did last week, and I will be working with Alfonso on eliminating the stop words of a tweet. Stop words are empty words that are filtered out from natural language processing, such as: the, is, at, which, and on.

By doing this, we will be able to create a more accurate words map without having a lot of concurrences of stop words and focusing just on the information that matters.

Stay tuned for the advancement of this week on the post mortem blog post.

Week 3: PyMySQL is about to be applied

--Originally published at TC3045 – Sagnelli's blog

“When you want to succeed as bad as you want to breathe, then you’ll be successful.”

– Eric Thomas

Hey, I’m still awake. Now, there’s a reason to it, and as bad as I want that reason to be watching a Netflix series, or playing videogames, it is not. I am thinking on what I will be doing this week for the Elections Analyzer 2018 project.

This week is going to be all about creating generic functions in Python for DML queries on a MySQL database. I will be dividing this functions as micro-services. Henceforth, there is going to be a separate micro-service for inserting (creating), selecting (reading), updating, and deleting, (CRUD), from a generic database. Everything using the PyMySQL library.

Stay tuned for the post-mortem update of this week’s development.

Week 2: The beginning

--Originally published at TC3045 – Sagnelli's blog

As the title states, this week is where it all begins. We already know the objective, and how we are going to achieve it, so let’s dive into work.

Resultado de imagen para gif working

If you want to see our progress, you can by visiting our repo. This week is going to be about getting up and running with the database, and start structuring the python project.

Summary:

  • Define microservices and divide functionalities.
  • Structure python project.
  • Getting up and running with the database.
  • Start developing with unit testing, and Agile methodology.

So, let’s begin…

Election year: let’s be genuine, shall we?

--Originally published at TC3045 – Sagnelli's blog

2018 is an important year for Mexico, where the next six years are supposed to be defined by the Mexican people; however, corruption has always interfered with democracy, as the government has been accused of manipulating the votes.

Resultado de imagen para simpsons vote gif

This is the problem we are trying to solve with our project. Now, the important question is:

Who are we?

We are a group of students between 6th and 8th semester of Computer Science at Tec de Monterrey Campus Guadalajara:

  • Alfonso Contreras
  • Arturo González
  • Alejandro Vázquez
  • Michelle Sagnelli

What is our solution?

Basically, in one sentence, we are building a series of microservices that will let us determine who is the best acclaimed, and the most popular presidential candidate according to Twitter.

How are we supposed to do it?

We will apply data mining using Python Streaming Jobs, and Twitter’s API to temporarily store tweets in JSON’s. Afterwards, this data will be shown and saved for later use.

The challenge is to clean data by mining keywords, eliminating stopwords, and assigning tokens by tweet importance. Henceforth, this “clean” data will be used to analyze with machine learning the importance of this year’s candidates, and political parties. Finally, this information will be stored in JSON format for further analysis of political parties information, and candidates’ level of acceptance.

Extras

We are trying to implement location-based analysis of tweets, and being able to find which tweets belong to bots for achieving a more successful analysis.

This should be fun. I am very interested in this project, as it is challenging, and interesting. If you are interested too, do not hesitate in contacting me, and stay tuned with mine, and my colleagues’ , future blog posts.

Regards,

Mike.

TC3045 Course expectations

--Originally published at TC3045 – Sagnelli's blog

We meet again Ken. It is blogging time about TC3045 course. This course is about quality and software testing. What I expect from this course is to improve my coding skills by learning how to use testing as an advantage, being able to write code that every programmer can understand, and learning how to use testing to avoid future bugs or errors.

I hope that this course is based on projects, as practice is the best way of improving skills and abilities. I also think that exams are not the best proof that students learned something because it is not a real situation.

I’ve already worked with Ken, and I guess that this course is going to be great with his methodologies, and his expertise.

“Life is about growing, and improving, and getting better.”

– Conor McGregor