Devoxx UK 2019
from Wednesday 8 May to Friday 10 May 2019.
Engineering Manager at Anaplan, with over 18 years of experience in Utilities, Platforms, E-Commerce, Asset Management and Finance. Special interest in Computational Social Science, Machine Learning and Data Science.
With recent development in Computational Social Science we see an increase in research projects using mainstream social media APIs for access to data.
It has been shown that APIs can be biased, less representative and suffer from unintended use limitations. We propose an alternative to use of APIs that is based on data scraping and have used oXpath, part of the wider Vadalog framework to bridge Machine Learning and Reasoning.
In this session we will propose a scalable architecture for capturing data from the deep web, using an implementation for a Demo. We will show advantages over conventional API usage, and will use Topic Modelling and Sentiment Analysis on captured data to try bring insight on how a particular community might be manipulated by use of bots and other means.