Professional Documents
Culture Documents
Abstract
Data analytics, particularly the current rhetoric around Big Data, tend to be presented as new and innovative, emerging
ahistorically to revolutionize modern life. In this article, we situate one branch of Big Data analytics, spatial Big Data,
through a historical predecessor, geodemographic analysis, to help develop a critical approach to current data analytics.
Spatial Big Data promises an epistemic break in marketing, a leap from targeting geodemographic areas to targeting
individuals. Yet it inherits characteristics and problems from geodemographics, including a justification through the
market, and a process of commodification through the black-boxing of technology. As researchers develop sustained
critiques of data analytics and its effects on everyday life, we must so with a grounding in the cultural and historical
contexts from which data technologies emerged. This article and others (Barnes and Wilson, 2014) develop a historically
situated, critical approach to spatial Big Data. This history illustrates connections to the critical issues of surveillance,
redlining, and the production of consumer subjects and geographies. The shared histories and structural logics of spatial
Big Data and geodemographics create the space for a continued critique of data analyses role in society.
Keywords
Big Data, geodemographics, data science, data analytics, black box, critical data studies
Corresponding author:
Craig M Dalton, Department of Global Studies and Geography, Hofstra
University, Hempstead, NY 11549, USA.
Email: craig.m.dalton@hofstra.edu
Creative Commons CC-BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial
3.0 License (http://www.creativecommons.org/licenses/by-nc/3.0/) which permits non-commercial use, reproduction and distribution
of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.
sagepub.com/en-us/nam/open-access-at-sage).
Downloaded from by guest on September 10, 2016
Both processes are market driven and demand a correspondence between the dispositions, attitudes, and
socioeconomic characteristics of a signicant majority
of the data subjects and the classication itself in order
to generate value (Uprichard et al., 2009: 2827).
However, geodemographics uses the neighborhood or
postal code, an area, as the unit of measure, whereas
spatial Big Data promises the same outcomes at the
individual level.
Spatial Big Data commodies the individual. Ones
personal locations, dispositions, attitudes, and socioeconomic characteristics are the object of analysis,
rather than geographically located populations.
Industry hype promises this shift will create the
killer application of the 21st Century: individually
targeted location-specic ubiquitous advertising
(Krumm, 2011). Such applications use location data,
combined with other information, to serve advertisements. For example, if an application on my phone
knows that Im at Lowes, it could remind me that my
mothers birthday is coming up and suggest I purchase
some tomato plants for her garden. If the application
records me moving at jogging speed over long distances, it might advertise new running shoes or a gym
with a track in the winter. Mobile devices, enabled with
location-tracking, accelerometers, pedometers, and
even heart rate monitors, transform individual people
into both sensors of the surrounding world (Goodchild
and Li, 2012) and sensors of themselves. Wolf, Kelly,
and others in the Quantied Self movement engage this
shift on a personal level (Wolf, 2011). On a wider level,
those who purchase, analyze, and leverage this data
collect an individuals information to target ads at
that individual person, not families or people residing
in the same postal code. More ows of data, not only
from ones purchase history, but also from ones locations, speed, and even heart rate, facilitate more personalized targeting.
On a fundamental level, both geodemographics and
spatial Big Data assume that social identity can be
reduced to measurable characteristics that can be algorithmically classied. Furthermore, as with social physics, this assemblage of personal data is predictive, or
can be made to be so, commodifying it as valuable in
marketing and ultimately making a sale. While geodemographics and spatial Big Data are hardly alone on
this point, this commodication proceeds in specically
geographic ways. A neighborhoods assigned geodemographic class or an individuals assembled prole can
become a self-fullling prophecy as people respond to
advertising as consumers. Feedback loops may form in
which consumers are presented with options based on
available data. Their choices are then used by marketers to target subsequent advertisements, advancing
some options and limiting the consumers perceived
unprecedented delity (Anderson, 2008). In this formulation, Google conquered the advertising world by
leveraging more and better data for prot without
concerning itself with explanation (Anderson, 2008).
Andersons end of theory argument has been criticized both by Big Data evangelists (Silver, quoted in
Marcus and Davis, 2014) and critics (Bollier and
Firestone, 2010). Though some scholars attempt to critically evaluate the epistemology of Big Data (Boyd and
Crawford, 2012; Burns and Thatcher, 2014; Dalton and
Thatcher, 2014; Miller and Goodchild, 2014), such
debates also contribute to the myth of its newness, serving a market function by building buzz around Big
Data as a business necessity, a must-have for competitive targeted advertising. The underlying market orientation, with its goals of predictive, actionable data over
austerely correct information, remains in both Big Data
and spatial Big Data.
The focus on data correlation for capital gain is
apparent in Facebooks recent deletion of fake
accounts and imposition of more stringent identication requirements for authentic personal accounts.
For Facebook, value is generated through a like economy whereby a single users social actions4 are
instantly turned into valuable consumer data and
enter multiple cycles of multiplication and exchange
(Gertlitz and Helmond, 2013). Facebook feared fake
likes from automated accounts, because they meant
that like data no longer correlated to reality, a gap
that could potentially cost the company revenue if
advertisers lost faith in the marketing value of likes
(Fung, 2014). Therefore, Facebook radically altered
who could participate and thus what data is created
under the impetus of its explicitly prot driven
orientation.
A similar impetus drove Foursquares recent reinvention as a geographically indexed place recommendation
service. With spatial Big Data, the use of mobile smartphones with GPS technology transformed physical
location and experience into a digital commodity that
may be bid for, bought, and sold (Thatcher, 2013).
Leveraging the data of their 45 million users,
Foursquare can build a cache of personal information
[the company] then uses it to provide [users] with suggestions on where to go in the future (Larson, 2014).
By knowing where users are and what they like,
Foursquare can sell ads targeted at the individual, as
represented by his/her data. Achieving the killer
app, Foursquare oers the possibility that when a
shopper goes to Best Buy, Samsung will be able to
send them an ad for its TVs (Frier, 2013). Building
the promise of a promise of prots, Foursquare
received a $41 million dollar loan (Cooper, 2007: 142)
and $50 million in additional investment (Delo, 2014)
to achieve this goal. As Foursquare moved more
Geographic scale
Individual
Epistemology
Yes
Yes
Practical approaches
The proponents of geodemographic analysis continue
to address its epistemological uncertainties through the
narrowing of concern to methodological issues. They
acknowledge the problem of the datas scale, discussed
above, noting concerns with the ecological fallacies of
data and the modiable aerial unit problem (Debenham
et al., 2003; Duckham et al., 2001). Nevertheless, such
foundational questions are set aside as geodemographic
systems relation to the real world and hence utility
remain judged by their competitiveness on the market
(Burrows and Gane, 2006; Harris et al., 2005). This can
have unforeseen consequences when, due to funding
cutbacks and neoliberalization of geographic government services (Burns, forthcoming; Leszczynski,
2012), commercial geodemographic systems, with
Theoretical approaches
Other scholars oer more theoretical critiques of geodemographic analysis that retain relevance for critically
understanding spatial Big Data. Goss (1995a) outlines
how geodemographic systems are a strategy for producing a control society and the social subjects within
that context. Surveillance technologies and practices
rst collect data about individuals that is classied
within geodemographic systems. The resulting socially
classied knowledge denes social and geographical
subject positions through advertising, reproducing
those social, geographical categories in peoples
material consumptive practices. In this way, a geodemographic system produc[es] the conditions of its own
reproduction (Goss, 1995a, 1995b). At stake in such
systems is not merely the invasion of privacy, but also
the very autonomy to choose individual paths and life
outcomes. For example, if based on census data, a particular neighborhood ts into the Hard-Pressed
Families geodemographic class, that neighborhood
may become the target of a direct mail campaign for
sub-prime mortgage renancing. As residents of that
neighborhood renance their properties, the neighborhood itself is materially reproduced by and in the terms
of that geodemographic class, making it ever more ripe
for subsequent sub-prime marketing.
Though powerful, Goss critique is contingent on
geodemographic targeting and the resulting advertising
actually functioning as well as proponents claim.
However, geodemographics need not be wholly eective to raise social and ethical questions.
Lyon (2002) argues that geodemographics relies on a
phenetic x of classication with consequences for
social opportunity. Geodemographic systems capture
personal data triggered by human bodies and . . . use
these abstractions to place people in new social classes
of income, attributes, preferences, or oences, in order
to inuence, manage or control them (Lyon, 2002). In
practice, some classes of people will be more promising
for particular ends than others and will therefore garner
more or better advertised opportunities, while others
are ignored or oered inferior options. Parker et al.
(2007: 917) highlight the recursive nature of this process: Class places people into dierent types of
places, in turn producing the spaces of particular
classes. The application and impact of geodemographic classication recursively reinforces this spatialization of class (2007: 917) by quantifying and
codifying it through the presentation of advertising
and services meant for that recursively constituted
class. Within trends towards more splintered urban realities (Graham and Marvin, 2001), Phillips and Curry
(2003) suggest a darker consequence: codifying classes
through geodemographic divisions constitutes a form
of redlining based on geodemographic classes and geographic units, and therein a loss of the public domain.
Spatial Big Datas production of social subjects presents similar issues, but at a more personalized scale.
For example, a Big Data analysis of data from a smartphone locative application, such as Waze or Gas Buddy,
would show that a particular motorist regularly drives a
particular stretch of road. Based on that knowledge,
gas stations along that route could oer advertisements
or even discounts on food with the purchase of gas,
producing or reproducing that motorist as a consumer
subject at that location. If the motorist in question
stops and takes advantage of a deal, that additional
10
11
exceptional newness may distract from them, but it cannot resolve them. As scholars and practitioners using
spatial Big Data, we are in part responsible for the
knowledge and social relations that these issues
create and re-create. In this context, it is crucial to
develop critical (and self-critical) perspectives and
approaches to spatial Big Data and subsequent technologies (Dalton and Thatcher, 2014). Just as GIS practitioners cannot stand aside from the processes and
consequences of the technology (Crampton, 2010),
we scholars and practitioners of spatial Big Data must
evaluate our situatedness and positionality (Haraway,
1991; Harding, 2004) and the forms of knowledge and
social relations we help produce. Such an approach
entails more than rening practices. It requires reexively analyzing spatial Big Data and its own analytics
in context, not as indicators of some event, but as phenomena and epiphenomena in and of themselves
(Wilson, 2014b). Establishing the historical, social context of a technology is a key step in demystifying and
denaturalizing it. The historical pre-conditions for spatial Big Data set by geodemographics and earlier developments (Barnes and Wilson, 2014) allow us to learn
from those earlier processes to better critically evaluate
current technologies and knowledge. Big Data has historical antecedents and so too does critical thought on
technology (Feenberg, 1999; Marcuse, 1982; OSullivan,
2006) and its spatial aspects (OSullivan, 2006; Pickles,
1995; Schuurman, 2000). These approaches can and
should inform reexive, critical engagements of spatial
Big Data.
Notes
1. Data about me and the analytics applied to it, not the
location-based hardware and software that collected it.
2. Geographic information, software, and hardware that
operate over the World Wide Web.
3. In June 2013, Google acquired Waze, a social media trafficnavigation company, for nearly one billion dollars after a
bidding war with Facebook and Apple (Graziano, 2013).
Weeks later, Apple acquired both Locationary Inc., which
maps business locations, and HopStop, an urban routefinding platform (Burrows and Frier, 2013; Paczkowski
and Gannes, 2013).
4. Such as liking a link or picture.
5. https://datasift.com/source/6/twitter
6. http://www.internetlivestats.com/twitter-statistics/
7. Open source geodemographic classifications, which utilize
public data and open algorithms, address blackboxing
issues. However, it tends to be exercised by the public
sector, with different imperatives and mandates for transparency than commercial geodemographic analyses
(Singleton and Longley, 2009).
8. An IP address is a series of numbers (such as 192.168.0.1)
used to identify every device participating in a computer network using the defined Internet Protocol for communication.
References
Anderson C (2008) The end of theory: The data deluge makes
the scientific method obsolete. Wired, 23 June. Available
at: http://www.wired.com/science/discoveries/magazine/
16-07/pb_theory (accessed 24 October 2014).
Andrejevic M (2014) The Big Data divide. International
Journal of Communication 8: 16731689.
Ansolabehere S and Hersh E (2012) Validation: What big
data reveal about survey misreporting and the real electorate. Political Analysis 20(4): 437459.
Arthur C (2014) Smartphone explosion in 2014 will
see ownership in India pass US. The Guardian: Global
Development. Available at: http://www.theguardian.com/
technology/2014/jan/13/smartphone-explosion-2014india-us-china-firefoxos-android (accessed 14 June 2014).
Baginski J, Sui E and Malecki E (2014) Exploring the intraurban digital divide using restaurant reviews: A case study in
Franklin County, Ohio. The Professional Geographer
66(3): 443455.
Barnes T and Wilson MW (2014) Big data, social physics, and
spatial analysis: The early years. Big Data & Society 1(1):
DOI: 1177/2053951714535365.
Battelle J (2005) The Search: How Google and Its Rivals
Rewrote the Rules of Business and Transformed Our
Culture. New York, NY: Penguin.
Batey P and Brown P (1995) From human ecology to customer targeting: The evolution of geodemographics.
In: Longley P and Clarke G (eds) GIS for Business and
Service Planning. London: Longman, pp. 77103.
Boellstorff T (2013) Making big data, in theory. First Monday
18(10). Available at: http://firstmonday.org/ojs/index.php/
fm/article/view/4869/3750 (accessed 24 October 2014).
Bollier D and Firestone C (2010) The Promise and Peril of Big
Data. Washington, DC: Aspen Institute, Communications
and Society Program.
Booth C (1902) Life and Labour of the People of London.
London: Macmillan.
Boyd D and Crawford K (2012) Critical questions for big
data. Information, Communication & Society 15(5):
662679.
Bughin J, Chul M and Nayika J (2010) Clouds, Big Data, and
smart assets: Ten tech-enabled business trends to watch.
New York, NY: McKinsey & Company. Available at:
http://www.mckinsey.com/insights/high_tech_telecoms_internet/clouds_big_data_and_smart_assets_ten_techenabled_business_trends_to_watch (accessed 4 November
2014).
12
Burgess J and Bruns A (2012) Twitter archives and the challenges of Big Social Data for media and communication
research. M/C Journal 15(5). Available at: http://journal.
media-culture.org.au/index.php/mcjournal/article/
viewArticle/561 (accessed 18 August 2015).
Burns R (2015, forthcoming) The Inequality of the Geospatial
Web: How Emerging Geographic Technologies Influence
Humanitarian Practices. PhD Thesis, University of
Washington, WA.
Burns R and Thatcher J (2014) Whats so big about big data?
Finding the spaces and perils of big data. GeoJournal,
Guest Editorial. Epub ahead of print October 2014.
DOI: 10.1007/s10708-014-9600-8.
Burrows P and Frier S (2013) Apple said to buy HopStop,
pushing deeper into maps. In: Bloomberg. Available at:
http://www.bloomberg.com/news/2013-07-19/apple-saidto-buy-hopstop-pushing-deeper-into-maps.html (accessed
24 October 2014).
Burrows R and Gane N (2006) Geodemographics, software
and class. Sociology 40(5): 793812.
Carr A (2013) Why Yahoo and Apple want Foursquares
data. In: Fast Company. Available at: http://www.fast
company.com/3016250/why-yahoo-and-apple-want-four
squares-data (accessed 4 November 2014).
Castree N (2003) Commodifying what nature? Progress in
Human Geography 27(3): 273297.
Cooper M (2007) Life as Surplus: Biotechnology and
Capitalism in the Neoliberal Era. Seattle: University of
Washington Press.
Cox M and Ellsworth D (1997) Application-controlled
demand paging for out-of-core visualization. In:
Proceedings of the 8th conference on visualization 97.
Phoenix, AZ, 1824 October 1997, pp. 111. USA: IEEE
Computer Society Press.
Crampton J (2010) Mapping: A Critical Introduction to
Cartography and GIS. Malden, MA: Wiley-Blackwell.
Crampton J, Graham M, Poorthuis A, et al. (2013) Beyond
the geotag: Situating Big Data and leveraging the potential of the geoweb. Cartography and Geographic
Information Science 40(2): 130139.
Crampton J, Roberts A and Poorthuis A (2014) The new
political economy of geographical intelligence. Annals of
the Association of American Geographers 104(1): 196214.
Dalton C (2013) Sovereigns, spooks, and hackers: An early
history of Google geo services and map mashups.
Cartographica 48(4): 261274.
Dalton C and Thatcher J (2014) What does a critical
data studies look like, and why do we care? Seven
points for a critical approach to big data. Society and
Space Open Site: Run by the Editors of the Journal
Environment and Planning D: Society and Space,
Available at: http://societyandspace.com/material/commentaries/craig-dalton-and-jim-thatcher-what-does-a-critical-data-studies-look-like-and-why-do-we-care-sevenpoints-for-a-critical-approach-to-big-data/ (accessed 24
October 2014).
Debenham J, Clarke G and Stillwell J (2003) Extending geodemographic classification: A new regional prototype.
Environment and Planning A 35(6): 10251050.
13
Krumm J (2011) Ubiquitous advertising: The killer application for the 21st century. Pervasive Computing, January
March 11, 6673.
Laney D (2001) 3D data management: Controlling data
volume, velocity, variety. Application Delivery Strategies
Meta Group File 949. Available at: http://blogs.gartner.
com/doug-laney/files/2012/01/ad949-3D-Data-Manage
ment-Controlling-Data-Volume-Velocity-and-Variety.pdf
(accessed 14 May 2015).
Larson S (2014) Foursquare CEO: How well tell you where
to eat and what to order. In: ReadWrite, 17 March, 14.
Available at: http://readwrite.com/2014/03/17/foursquaredennis-crowley-ceo-anticipatory-computing (accessed 24
October 2014).
Latour B (1999) Pandoras Hope: Essays on the Reality of
Science Studies. Cambridge, MA: Harvard University
Press.
Laurila J, Gatica-Perez D, Aad I, et al. (2012) The mobile big
data challenge. Nokia Research. Available at: http://
research.nokia.com/files/public/MDC2012_Overview_Lau
rilaGaticaPerezEtAl.pdf (accessed 20 October 2014).
Lazer D, Kennedy R, King G, et al. (2014) The parable of
Google Flu: Traps in big data analysis. Science 343(6176):
12031205.
Leslie D (1999) Consumer subjectivity, space, and advertising
research. Environment and Planning A 31(8): 14431458.
Leszczynski A (2012) Situating the geoweb in political economy. Progress in Human Geography 36(1): 7289.
Leszczynski A (2014) On the neo in neogeography. Annals of
the Association of American Geographers 104(1): 6079.
Leys C (2001) Market Driven Politics. London: Verso.
Lohr S (2012a) Sure, big data is great. But so is intuition. The
New York Times, 29 December, 12. Available at: http://
www.nytimes.com/2012/12/30/technology/big-data-isgreat-but-dont-forget-intuition.html (accessed 24 October
2014).
Lohr S (2012b) The age of big data. The New York Times, 11
February. Available at: http://www.nytimes.com/2012/02/
12/sunday-review/big-datas-impact-in-the-world.html?_r
1&scp1&sqBig%20Data&stcse&gwhF7CD17271A
F71E11F85924B08AF1A18A&gwtpay&assetTypeopi
nion (accessed 30 May 2015).
Long X, Jin L and Joshi J (2012) Exploring trajectorydriven local geographic topics in Foursquare. In:
Proceedings of ACM Ubicomp 12. Pittsburgh, PA,
pp. 927934.
Longley P (2005) Geographical information systems: A
renaissance of geodemographics for public service delivery. Progress in Human Geography 29(1): 5763.
Longley P (2012) Geodemographics and the practices of geographic information science. International Journal of
Geographical Information Science 26(12): 22272237.
Longley P and Harris R (1999) Towards a new digital data
infrastructure for urban analysis and modelling.
Environment and Planning B: Planning and Design 26:
855878.
Lyon D (2002) Surveillance studies: Understanding visibility,
mobility and the phonetic fix. Surveillance and Society
1(1): 17.
14
15