Reading:
Navigating the complexity of change aversion
Image
Big Data, AI and Science

AIlon Story

From Social Media data to augmented intelligence

From Science to Business

One of our founders was looking for a topic for his masterthesis in Data Science in 2016. Right before the media buzz around psychological targeting and Cambridge Analytica, he read about "Evidence Lifts from Facebook Likes" in the book Data Science for Business, where the authors described how to calculate the IQ of an individual based on what pages an individual follows on Facebook. We can strongly recommend this book if you want to get into Data Science / Machine Learning.

Data Science for Business

The authors state that it is also possible to calculate various socio-demographic attributes based on what people follow on Facebook. At this point it was clear that these methods were absolutely underrated - as nearly all of us have a digital footprint and there was absolutely no reason to assume the results to be limited to the aforementioned domains - the opposite was the case. As great as the actual idea was - the proposed Machine Learning algorithms actually implied wrong assumptions about the data.

Further research into the topic brought to light that the entire topic originated with a study called Computer-based personality judgments are more accurate than those made by humans where researchers proved algorithms to assess personality of an individual as well as its spouse (see the following figure). The figure shows that the more pages an individual follows on Facebook, the more accurate the algorithm assess the individual's personality (the accuracy of a working colleague can be reached given ~ 10 followings, the one of the spouse ~270 followings):

Image
Computer-based personality judgments are more accurate than those made by humans
Wu Youyou, Michal Kosinski and David Stillwell

These results are astonishing as it is typically believed that accurate personality perceptions stem from social-cognitive skills of the human brain. Thus, this research had a quite massive media echo:


The methodological lack persisted

Despite the media echo, no further research regarding the method has been provided. In course of his master thesis, one of our founders proposed a refined approach to increase prediction accuracy using Elastic Net Feature regression and Support Vector Machine Regression with a non-liner kernel function to benefit from the interaction terms between Facebook followings with extraordinary results.

Image

higher accuracy

of the data

~100% higher accuracy through elaborated algorithms using just a tenth of the data

The figure above visualizes the results, which look very similar to the ones we have been seen before. Just look at the y-axis: the new method "starts" with an accuracy of 58% (corresponding to the aforementioned accuracy of the spouse), given just ten Facebook followings.

There have been five major learnings:

  1. It is necessary to handle this topic with highest ethical standards and strict data privacy ruling.
  2. There is much more information in the data as thought before
  3. It is not necessary to build the "data leeche". Build smart data systems instead of big data systems.
  4. Results are still significantly improvable by using Deep Learning.
  5. The results are not limited to the domain of Facebook likes.
If we can predict something as abstract as personality, there must be so much more we can predict

Understand how it works

Ready for a demo?

Contact us to get into augmented intelligence - better today than too late.

Please type your name.