Scraping the Web for Data
by Jennifer Guerim Kim
Definition of social network site is “a web-based services that allow individuals to construct a public or semi-public profile within a bounded system, articulate a list of other users with whom they share a connection, and view and traverse their list of connections and those made by others within the system.” (Boyd&Ellison, 2007)
Social media networking is the communication, a countless array of internet based tools and platforms to interact between people. This networking is an enormous amount of information that can be easily shared, searched, promoted, disputed, and created. These countless information, images or websites are specified and simplified by assigning or “tagging” individual sites with searchable key words.
The data collection of social media has benefits on getting new, current and representative data sources through every post and conversation. These data offers not only historical view but also an up-to-the second, streaming record of people’s beliefs, attitudes and actions which has a complete picture of their audience and can even use some of the data to predict future behavior when data is combined.
Twitter is broadcasting daily short messages about modern attention deficit world rapidly and scan-friendly. It is about discovering interesting people online, and following their burst messages for as long as they are interesting.
Af first, I searched #sugar, #sugarsweetenedbeverage, #healtheffects and found radio contents, ‘All the dirty facts about sugar, and why we can’y get enough of it’, which discuss about the barriers of our sugar addiction and the reason why we can’t stop putting in our bodies despite its horrible effects with doctors and specialists. They introduce why it’s so bad for us at the introdcution of the radio show, they focus on what it does for our bodies now, as well as over time, how the doctors and our guest deal with their sugar cravings, how to kick the bad habits of snaking on sugar and explain addiction to sugar and how we can overcome it. From this informative radio series about sugar, I found interesting points of consumption of sugary foods, most of consumer are just not aware how much sugar include in their foods due to the hidden form of sugar and different ways of sugar labelling. I tried to search up #hiddenformsofsugar and #sugarlabelling on next step.
The I found the Henrietta Norton’s blog who interested in healthy living and diverse points of view about sugary foods and heathy living. She also mentioned a list of ‘hidden form of sugar’ such as Dextrose, Modified starch, Fructose, Corn syrup, Sorbitol, Fruit juice concentrate etc.
Twitter Archiver is the simplest tool for saving tweets and it is easily capture all tweets that match particular search terms in a Google Spreadsheet automatically. People can use the tool to monitor tweets around any conference hashtag, learn what people are saying about brand, track popular search terms, save tweets from any geographic location and more.
I searched as #sugar or #sugaryfood or #sugarlabelling or #sugarnuturition.One of thousand tweets, I found the campaign as ‘ I quite sugar’ and the most highest ‘retweets’ number of their post was the shock photograph of Nutella’s nutrition.
Moreover, I found the campaign which is called by ‘I quite sugar’ and they encourage people to cut out the sugary food and teach how to eat real food follow their suggested meal plans by qualified nutritionists through their 8 weeks program. They share their programs to quit sugary foods and recipes for cooking fresh and healthy food and deep inside of sugar further information.
In overall, I could easily find on-going most contemporary issues and trends through scrapping web. Especially, references from Tweeter becomes to be an opportunity to expand my ideas and able to scope into issue of sugary foods follow the trends and get public mass opinions. Moreover, I think ‘hashtag’ and ‘retweet’ are medium to how popular issues bring people’s sympathy. As meaning of word ‘web’, search engine of social media is one of the most impertinent key method to get designer’s needs and aims on design research process through its connecting links and interaction between web users.
Agarwal, A. 2015, How to save tweets for any Twitter hashtag in a Google Sheet, Digital Inspiration, viewed on 20 Sep 2016, <http://www.labnol.org/internet/save-twitter-hashtag-tweets/6505/>.
All the dirty facts about sugar 2016, The Staying Young show 2.0, Stay Young Media Group, New York, 13 July.
Boyd, D. M. & Ellison, N. B. 2008, Social Network Sites: Definition, History, and Scholarship, Journal of Computer-Mediated Communication, pp.210-230, viewed on 21 Sep 2016, <http://onlinelibrary.wiley.com/store/10.1111/j.1083-6101.2007.00393.x/asset/j.1083-6101.2007.00393.x.pdf;jsessionid=B9BC8D3A937D5AAC232F9AA4E975A679.f01t01?v=1&t=itc9mrpd&s=c99f0fffba78964163c8e76bbda9668f107df743>.