Blog post 6: data scraping

During my exploration of data online I came across a vast lack of information regarding everything associated with online privacy and anonymity. I feel that the general public has very little understanding or care for their rights online and the dangers of those who misuse it. Whilst there were some very valuable articles that I have found along my research journey, they were often scholarly articles that would be impossible to find without directly searching for their specific topics.

This then led me down the track of trying to understand how widespread this problem is. I looked at twitter data using a scraper and found a very interesting trend of data. There were a lot of people with low follower count talking about all these issues but there were very few people with follower counts above twenty five thousand talking about the same issues. This however is due to people re-tweeting the same information but it is still a valid point of view.

Below is an example of the data that I am able to gather when searching for certain keywords or hash tags. Also included is the tweet text and other pieces of information such as location data and tweet id. I feel that creating a series of visualisations form this data using processing might be an interesting visual outcome.

9/22/2016 4:10:37 @MGCommunityFeed MG Community 494 47
9/22/2016 4:10:39 @elisabettafaval elisabetta favale 2242 844
9/22/2016 4:10:46 @TheDefpom Scott G. 23 34
9/22/2016 4:10:58 @luke_stark stark | contrast 640 1099
9/22/2016 4:11:04 @thisguy420311 ThisGuy 50 133
9/22/2016 4:11:08 @Feeh_Marqs Fernanda Marques 41 234
9/22/2016 4:11:11 @ArleenTennant Arleen Tennant 10 37
9/22/2016 4:11:27 @rojasjenrenee Jennifer Rojas 247 628
9/22/2016 4:12:04 @nuraminnn_ amin 770 553
9/22/2016 4:12:05 @ManufacturingOD Vender Laster 147 18
9/22/2016 4:12:05 @yashalevine Yasha Levine 5259 292
9/22/2016 4:12:09 @USDroneWarDept Mama I’m Comin Drone 2265 557
9/22/2016 4:12:15 @Bobo_PK Bobo_PK 236 148
9/22/2016 4:12:15 @mmmonk Marek Lukaszuk 96 81
9/22/2016 4:12:20 @geejayeff geejayeff 965 832
9/22/2016 4:12:25 @OrganicAnt Lucid Tree 1097 1310
9/22/2016 4:12:28 @svaroschi Antonella Napolitano 7137 4958
9/22/2016 4:12:43 @news___follower News Follower 1444 961
9/22/2016 4:12:51 @ovolovely Abe O C Kowo 611 1773
9/22/2016 4:12:52 @theAfroLegalise Nnenna 5548 915
9/22/2016 4:12:54 @kaitlynschwers Kaitlyn Schwers 540 565
9/22/2016 4:12:55 @yaboybrendan Bren 874 977
9/22/2016 4:12:56 @EurecatSecurity Eurecat IT Security 15 5
9/22/2016 4:12:56 @KKanagCDM K.Kanagasubramanian 177 599
9/22/2016 4:13:01 @theAfroLegalise Nnenna 5548 915
9/22/2016 4:13:03 @Reddit_Privacy Reddit Privacy 338 76
9/22/2016 4:13:12 @jo3_f heyNSA Get A Warrant 7223 5231
9/22/2016 4:13:12 @rvivara Rodrigo Vivar 221 430
9/22/2016 4:13:14 @savetimeandmoey Save Time and Money 1878 332
9/22/2016 4:13:14 @savetimeandmoey Save Time and Money 1878 332
9/22/2016 4:13:14 @tturricane Alex Bruns 12 32
9/22/2016 4:13:17 @existais Hywel Arnold 726 1996
9/22/2016 4:13:22 @lilyaz_ princess pinky_ÙÕ¥_ÙÔÔ 1644 1127
9/22/2016 4:13:23 @OneNineEightOne OneNineEightOne 35 69
9/22/2016 4:13:23 @nanduhari Nandakishore H 2184 276
9/22/2016 4:13:30 @RobMcDoogie Rob McDougall 958 1098
9/22/2016 4:13:32 @em2wice Snake Plisskin 2951 3380
9/22/2016 4:13:40 @tonydamiani Tony Damiani 235 1670
9/22/2016 4:13:42 @shannonthebull Shannon Sewell 188 678

Jack Sinclair