From Science to Business
One of our founders was looking for a topic for his masterthesis in Data Science in 2016. Right before the media buzz around psychological targeting and Cambridge Analytica, he read about Evidence Lifts from Facebook Likes in the book Data Science for Business, where the authors described how to calculate the IQ of an individual based on what pages an individual follows on Facebook. We can strongly recommend this book if you want to get into Data Science / Machine Learning.
The authors state that it is also possible to calculate various socio-demographic attributes based on what people follow on Facebook. At this point it was clear that these methods were absolutely underrated - as nearly all of us have a digital footprint and there was absolutely no reason to assume the results to be limited to the aforementioned domains - the opposite was the case. As great as the actual idea was - the proposed Machine Learning algorithms actually implied wrong assumptions about the data.
Further research into the topic brought to light that the entire topic originated with a study called Computer-based personality judgments are more accurate than those made by humans where researchers proved algorithms to assess personality of an individual as well as its spouse (see the following figure). The figure shows that the more pages an individual follows on Facebook, the more accurate the algorithm assess the individual's personality (the accuracy of a working colleague can be reached given ~ 10 followings, the one of the spouse ~270 followings):
These results are astonishing as it is typically believed that accurate personality perceptions stem from social-cognitive skills of the human brain. Thus, this research had a quite massive media echo:
The methodological lack persisted
Despite the media echo, no further research regarding the method has been provided. In course of his master thesis, one of our founders proposed a refined approach to increase prediction accuracy using Elastic Net Feature regression and Support Vector Machine Regression with a non-liner kernel function to benefit from the interaction terms between Facebook followings with extraordinary results.