We all know the Internet loves cats.
Cats that eat cheezburgers, cats that dance, cats that play piano… And cats that can tell strangers where you live.
OK, the cat’s don’t actually talk. But by using the meta-data embedded in all those 15 million cat photos and videos a project called iknowwhereyourcatlives.com has plotted the physical locations of more than a million cats.
And it’s a much scarier and more sobering project than it might first appear.
What is meta data?
Meta data is data that describes other data — in other words, it describes the contents or context of a piece of data. In this case, the meta data is embedded in digital photos and can include who owns the image, copyright and contact information, what kind of camera created the file, along with exposure information and descriptive information such as keywords about the photo.
The iknowwhereyourcatlives.com project exploits the fact that almost every smartphone and many stand-alone cameras also record the longitude and latitude where the photo was taken.
This information is often used by photo organization software. The iOS Photos program organizes photos by where they were taken by default.
But as the cat website project proves, other people can access that information as well.
Owen Mundy, an artist, programmer, and professor at Florida State University created the project and says, “This project explores two uses of the internet: the sociable and humorous appreciation of domesticated felines, and the status quo of personal data usage by startups and international megacorps who are riding the wave of decreased privacy for all.”
What cats have to do with your privacy — and your kids.
As Mundy points out, most Internet users are painfully unaware of the data we freely share with the world.
Smartphones and digital cameras automatically record location information, but we make the data public by sharing the photos — with their metadata intact — on sites like Flickr, Twitpic, Instagram and others.
And that data is freely available to any programmer with a little patience and inginuity.
The problem is that this kind of meta data isn’t limited to pictures of cats — and if we can create a website that can pinpoint where your cat lives within 7.8 meters accuracy, we can do the same for children.
Any photo of any subject — your home, your valuables, even your children — can be plotted on a map by anyone who chooses to look at the associated meta data.
People are woefully uneducated about the information they’re automatically sharing with the Internet. Facebook and Twitter automatically strip out location data when you upload a photo, while Instagram and Flickr give you the option (but if you add a photo to your photo map, obviously the data stays intact). Google+, Dropbox, and Tumblr, however keep that location data free for anyone to find.
You can manually remove the location data from your photos before sharing, but few people go to the trouble.
I think, however, they might when “iknowwhereyourkidlives.com” comes online.
DSC Resources
- Career: Training | Books | Cheat Sheet | Apprenticeship | Certification | Salary Surveys | Jobs
- Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC
- Buzz: Business News | Announcements | Events | RSS Feeds
- Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers
Additional Reading
- What statisticians think about data scientists
- Data Science Compared to 16 Analytic Disciplines
- 10 types of data scientists
- 91 job interview questions for data scientists
- 50 Questions to Test True Data Science Knowledge
- 24 Uses of Statistical Modeling
- 21 data science systems used by Amazon to operate its business
- Top 20 Big Data Experts to Follow (Includes Scoring Algorithm)
- 5 Data Science Leaders Share their Predictions for 2016 and Beyond
- 50 Articles about Hadoop and Related Topics
- 10 Modern Statistical Concepts Discovered by Data Scientists
- Top data science keywords on DSC
- 4 easy steps to becoming a data scientist
- 22 tips for better data science
- How to detect spurious correlations, and how to find the real ones
- 17 short tutorials all data scientists should read (and practice)
- High versus low-level data science
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge