post 6

by mart yeung

from the last week exercise i was introduced with the twitter engine searcher called Twitter Archiver  that allows use to create an excel to capture the posts with specific words and hashtags. This method is entry level of big data usage for collecting and selecting. The program works as following: install the program and ink to my twitter account, which the program will access my account as a portal to scan through the platform. Then type in any key words i think can be linked to the topic as well as hash tags, you can also put in certain words that has to be filtered to avoid certain spam of irrelevant content. The results will be showned on the Google spreadsheets.
this tool is useful to me as it can be used to research updated informations and updated opinions and events associated with the issue. The quantity and the trend of opinion can reflect the affect area of the data retention.
In the first search I used:
online privacy data OR retention  OR surveillance OR anonymous OR online OR privacy OR
words: online privacy
hashtag: #bigdata
from the first the research, I found the data collection is not as relevant as predicted. It scanned three hundreds of post that talks about the issue, but mainly are spam advertising and reposting of articles. The key words might not be leading towards the data I need directly, which possibly because the users didn’t talk with the issue directly with the key words. A further more localized or target audience group should be aimed.
the second search I used:
online privacy data OR retention OR government OR surveillance OR anonymous OR online OR privacy OR data freedom government surveillance anonymous
words: online privacy
hashtag: #bigdata #spy #privacy #anonymous
anti-word: Windscrube
The research is focus on the human rights over the online security, and added few words to specify the topics, see if any opinions that say how online surviellience and data retention has affect their human rights and how different region differs values.
Also one of the criteria of the research is the region or the area of posts, as different countries will have a different response towards the issue due to culture, the media trend or the politics of the countries. Countries that have selection or firewall to block access of social media like twitter are hard to find the voice and opinion of the people, for example, turkey, Afghanistan, Iraq, North Korea and China. But ironically these countries are proposed to have heavy rely of data retention and the problem of the online security should have been reflected significantly on the users.
some countries have filtered some key words that associated to their censored or sensitive information, in which those key words can not be shown up with the certain region
overall, a further experiment of different words to search the internet has to be practiced, and different social media should be observed in order to make a comparison of opinions.
Not much direct message or first person statement are found to support the online surveillance. On the topic of human right, there
I was expected to see some personal report of their experience or their sharing, but ironically people didn’t put up many personal comment or experience on twitter, which search of different platforms have to be practiced, such as Facebook or tumblr.