This is the second part of a blog trilogy on big data. Click here to read the first blog, in which he discusses the business disruption and reinvention caused by big data. Click here to read the third blog on privacy issues.
Concerns about privacy of personal big data and its use in surveillance by governmental agencies have been well exercised—although largely unresolved—since Edward Snowden’s dramatic revelations beginning in June 2013. Thinking and real debate about the privacy of personal data collected and used by commercial organisations has been remarkable largely by its absence.
In essence, a business exists—ideally, perhaps—to satisfy, to the best of its ability, its customers’ and society’s needs through the products and/or services it develops and offers. An implicit agreement exists between the parties that, in order to best meet those customer needs, the business must understand something about the customers themselves, their uses of and opinions about existing products, and so on. Furthermore, customers also agree to receive (hopefully relevant and timely) marketing information about the business’ offerings. Both activities necessitate that the customer (or prospect) willingly relinquishes some measure of privacy in return for better products or service. Since the 1980s, this agreement has been broadly shaped by various privacy codes based on declaration of information usage by the business and the customer’s ongoing and informed consent.
With the advent of big data and, in particular, the smartphone as harbinger of the Internet of Things, both transparency of use and ongoing, informed consent have been seriously compromised. Transparency of use is almost non-existent thanks, for example, to data brokers now gathering thousands of measurable attributes about unsuspecting consumers (people) to create mailing lists and scoring algorithms to enable targeted marketing that is often largely indistinguishable from blatant discrimination. And as anybody who has ever tried to install a smartphone app without checking all data collection boxes knows, consent has been reduced to a formality. Indeed, as marketing moves to always-on and location-aware, the idea of ongoing consent may become a distant memory.
Addressing privacy concerns requires consideration of both business and technology. On the business side, monetisation models that fund “free” services through targeted advertising are particularly prone to abuse of users’ privacy. Of course, Internet behemoths like Google and Facebook are highly dependent on and successful because of this approach. But at what cost to personal privacy, which is at the heart of democracy? At a more detailed level, the ethics of collection and use of particular types of data must be considered. What are the potential negative implications of having particular data about people? Even if you have data, or the ability to combine existing data for new insights, that doesn’t mean you should use it. Above all, you must be transparent about the planned uses of the personal data you collect, and avoid the use of data that has been gathered by dubious means.
From a technology viewpoint, it is clear that strong data security is a sine qua non for even basic protection of privacy. Observe that when different data is combined from multiple sources, the context offered by one source may expose behaviours obscured in another. At the extreme, some researchers maintain that anonymised data in one source can be relatively easily de-anonymised when combined with as few as three other data sets. IT is also responsible for ensuring that data is managed and used in accordance with widely varying local privacy laws; such considerations must be reflected right back in the logical architecture of corporate data warehouses and BI systems. Legal and financial consequences of noncompliance can be severe: for example, new EU privacy laws allow fines up to 5% of global revenue or €100m. Furthermore, best legal/ethical practice suggests that the use of personal data must be regulated according to individual privacy preferences.
Privacy is a topic whose application is viable at the level of individual businesses and their IT staff. In the final part of this series, I’ll focus on an issue with broader scope of impact and resolution: the potential economic and social consequences of analytics and automation enabled by big data.
This is the second part of a blog trilogy on big data. Click here to read the third blog.
Click here to read the first blog, in which he discusses the business disruption and reinvention caused by big data.
7 november (online seminar op 1 middag)Praktische tutorial met Alec Sharp Alec Sharp illustreert de vele manieren waarop conceptmodellen (conceptuele datamodellen) procesverandering en business analyse ondersteunen. En hij behandelt wat elke data-pr...
11 t/m 13 november 2024Praktische driedaagse workshop met internationaal gerenommeerde trainer Lawrence Corr over het modelleren Datawarehouse / BI systemen op basis van dimensioneel modelleren. De workshop wordt ondersteund met vele oefeningen en pr...
18 t/m 20 november 2024Praktische workshop met internationaal gerenommeerde spreker Alec Sharp over het modelleren met Entity-Relationship vanuit business perspectief. De workshop wordt ondersteund met praktijkvoorbeelden en duidelijke, herbruikbare ...
26 en 27 november 2024 Organisaties hebben behoefte aan data science, selfservice BI, embedded BI, edge analytics en klantgedreven BI. Vaak is het dan ook tijd voor een nieuwe, toekomstbestendige data-architectuur. Dit tweedaagse seminar geeft antwoo...
De DAMA DMBoK2 beschrijft 11 disciplines van Data Management, waarbij Data Governance centraal staat. De Certified Data Management Professional (CDMP) certificatie biedt een traject voor het inleidende niveau (Associate) tot en met hogere niveaus van...
3 april 2025 (halve dag)Praktische workshop met Alec Sharp [Halve dag] Deze workshop door Alec Sharp introduceert conceptmodellering vanuit een non-technisch perspectief. Alec geeft tips en richtlijnen voor de analist, en verkent datamodellering op c...
10, 11 en 14 april 2025Praktische driedaagse workshop met internationaal gerenommeerde spreker Alec Sharp over herkennen, beschrijven en ontwerpen van business processen. De workshop wordt ondersteund met praktijkvoorbeelden en duidelijke, herbruikba...
15 april 2025 Praktische workshop Datavisualisatie - Dashboards en Data Storytelling. Hoe gaat u van data naar inzicht? En hoe gaat u om met grote hoeveelheden data, de noodzaak van storytelling en data science? Lex Pierik behandelt de stromingen in ...
Deel dit bericht