Author Archives: medcon

Biobank’s (April) Breach in Context

You may have seen, well, all over the news, that Biobank has done the emperor’s new clothes thing after losing data nearly 200 times that were counted (plus more that were not)…

Biobank have lost control of the data

These failures show that Biobank volunteers’ data is ‘out in the wild’ as researchers implied last year but Biobank did nothing about it.

Biobank itself doesn’t know who has got it and is being forced to resort to filing “take-down notices” on the various copies of (various amounts of) Biobank volunteers’ (medical and DNA/genomic) data that are popping up all over the place. Biobank aren’t very good at it.

Alibaba was just the latest incident, which Rory once again played down – both to the Biobank volunteers themselves, and to the British public. Biobank volunteers’ data should never have left Biobank’s (so-called) ‘Secure Data Environment’. Full stop. UK Biobank offered a download option from an environment they told their users was secure; this was after they told people they had ceased downloads. We’ve summarised the ongoing messes in various documents over years. 

“De-identified” health data is eminently re-identifiable, as The Guardian proved a few weeks ago. It is not ‘anonymous’ data – it is sensitive personal data.

Did Biobank volunteers sign up to having their data sold to eugenicists, insurers, shell companies registered in the office that (also) serve as fronts for QAnon conspiracies, TikTok’s holding company in the Cayman Islands, or sending their DNA and medical history to an undergraduate course in China

How did the people on that undergraduate course get approval for access? Did Biobank simply give access to anyone who the university said was on the course? Does it have any way to know any of that? UK Biobank has defended itself by saying Yale had a previous failure that it was sanctioned for. If you look at the Biobank project list there is no obvious project which has been closed for breaking the rules, but there are many ongoing projects as if there was no problem at all. Any sanction must have been so minor there’s no public disclosure of any of it, but Rory decided to pick on them to save his job (or NHS England can publicly confirm contemporaneous reports of breaches).

Is this behaviour what the public and patients should expect from the new Health Data Research Service?

HDRUK and Biobank share a culture

This systemic, cultural problem does not only apply to Biobank… 

In their email to their participants, Biobank direct readers to this page which says protections are “within the ‘Five safes’ framework.” We know HDRUK and UK Biobank share a culture, which is to undermine the framework by reckless and unsafe changes.

Polly Toynbee might be happy to have her medical history and genome on the internet – that is her choice to give it to Biobank and her decision is informed by her close relationship with UK Biobank leadership who insist they never do anything wrong – but would the 499,999+ others make the same decision about their data and tissue and blood if Biobank had told them up front that “research” might include sending their data to all of these outfits? That Biobank knew their system allowed downloads, but didn’t want to talk about it because staff hoped no researcher would notice? Did they tell Polly that?

Biobank tells GPs that “Careful checks are in place to ensure confidentiality and data security” which clearly isn’t true. The same page has a GP quoted by Biobank saying: “With its internationally respected managed access model and robust data protection, UK Biobank ensures this enhanced resource will drive innovation responsibly”. It wasn’t true when Biobank published it in February, and isn’t true now.

UK Biobank has started talking publicly about another wave to their cohort, which is an opportunity to give all members the ability to give a fully informed consent to continue participating. (or, if they don’t respond, to have their data flows stopped? Or will people still have to beg the Biobank call centre for the secret form to fill out and return to end their participation? What will they be told about this debacle)

With Wes Streeting breaking ‘pandemic-only’ promises about uses of patients’ GP data (having been lobbied to do so by Biobank…) and with his intention to suck up everyone’s GP data into his ‘Single Palantir Record’, this is an issue that could (and will, if the government doesn’t stop steamrollering ahead) affect everyone in England.

Biobank insists that this time their catastrophes are different. We know how that always goes… 

The latest Biobank Screwup (April 2026)

Unsurprisingly, little has changed since the data was last lost in March and is reidentifiable.

Today there was a ministerial statement in the House of Commons about a new breach (at noon exactly)

Biobank claim that the data is “deidentified” – all that means if that you can’t find someone you don’t know, if you know anything about someone you can read off the rest of their health record.

A series of terrible decisions by Biobank’s outgoing leadership have all come together at once.

That UK Biobank have referred themselves to the Information Commissioner shows they know this was personal data. it was personal data.

Biobank told their funders, their members and others that they operated a secure data environment, while allowing their users to download whatever data they wanted.

Will the “Biobank Direction” be rescinded?

They lost a lot of data

“In its statement, Biobank said the type of data offered for sale included:

• Population characteristics such as gender, age, month and year of birth;

• Assessment centre data including attendance date, socioeconomic status, lifestyle habits, mental health, self-reported medical history, cognitive function and physical measures;

• Measures from biological samples including haematology, biochemistry, metabolomic and proteomic;

• Online questionnaire data on sleep, diet, work environment and mental health;

• Health outcomes data on ICD-coded diseases, cancer diagnosis date etc.”

(source)

Why the Biobank breaches matter to you

[if you are looking for opt out instructions, they’re on our how to opt out page]

The “pandemic-only” promise about uses of GP data has been torn up in favour of Biobank

The Guardian then found the full patient-level NHS hospital data given to UK Biobank about its volunteers had been posted to the internet on several occasions – with signs of hundreds of instances of rule breaking and smaller breaches. UK Biobank insists that the NHS hospital records of their cohort are not personal data, despite The Guardian being able to reidentify individuals’ full hospital histories from other information in the dataset.

Why does this matter if you’re not one of the half million people in Biobank? It matters to you because the data leaked by UK Biobank is the same linked, patient-level NHS hospital data that is routinely sold (in full or in part) to hundreds of other organisations. And, even if you have opted out, your data is usually included.

Continue reading

Dog Cancer – is cancer treatment for your dog becoming better than treatment for your grandma, or your kids?

In 2024, we looked ahead to treatments at the end of this Parliament. In 2025, we said: “Mr Streeting knows that by the end of this Parliament in 2029/2030, it’ll be increasingly normal for you to be able to take your cat to the vet to have your cat’s cancer effectively cured.” 

As things turn out, it’s a dog that got famous for a custom mRNA vaccine to treat some cancers

Continue reading

The latest (March 2026) Biobank mess (and consequences for everyone else)

Even before these new revelations, UK Biobank had a very long list of unanswered questions (that PDF was published earlier this week and now needs extending). At the same time, Mr Streeting has decided to give Biobank data from GP records that was collected under a promise it would be used only for the pandemic.

What did the Minister know when he signed the Biobank direction? What did those who publicly supported the Direction know? Did Biobank tell them everything?

Why this matters even if you’re not in Biobank:

The Biobank direction means “pandemic only” dataset can now be reused however Mr Mandelson’s political protégé decides – GPs have been given no choice because NHS England already has the data and uses it however they are told. This action already destroys trust for the next pandemic, and undermines promises being constructed for Mr Streeting’s Single Patient Record plans where he’ll make political promises around becoming data controller for your medical notes. Apparently this is the acceptable approach and standards for where your data will go in the National Data Library.

Biobank data is still was published on the internet 

The Guardian has reported that the NHS hospital data of UK Biobank participants was repeatedly published by Biobank users, and some of it is still publicly available months after Biobank was first told that Biobank patient level data was published online. This notification was before the Direction was signed which will allow “pandemic only” GP data to flow to Biobank to be used like the rest of the Biobank data.

The statement on the Biobank website completely omits that this happened and this remains the case.

Biobank admit they don’t know who their users are

Biobank have sent many legal notices to have material taken down from the internet.

UK Biobank admits that, in every case where they send a legal notice, that is because Biobank’s attempts to identify and contact the researcher have failed. Either Biobank don’t know who the researcher is, or the researcher doesn’t care enough to reply to the Biobank email. 

It is clear that Biobank does not know who their active researchers are, because if Biobank did know who the users were, Biobank would not have to resort to takedown requests for accounts they can not identify.

In any event, Biobank gave them (or someone) access to that data in the first place – the application form is short and woefully insufficient, but it does have a space for an email address. Emails from Biobank that researchers ignore alongside ignoring the Biobank rules that Biobank say protect the NHS data they share.

Since Biobank resorted to these legal means, did Biobank notify NHS England they were doing this over NHS sourced data? 

That’s before we consider approved data use in Chinese undergraduate teaching – the lecturer is granted access, but the students get it too and Biobank has no way to know who they are.

Biobank blame their victims for Biobank’s failings
UK Biobank simply claims that no Biobank member has been harmed, and if they have, then it’s their own fault.

If you’re in Biobank, and if anyone knows anything about your medical history, they can potentially read it all. Apparently the bland text on page 23 of this newsletter was Biobank telling you about the risks you had chosen to take, and Biobank would allow researchers to take.

Given the nature of researcher conduct, it is not possible to guarantee that there are no further examples.

NHS England did a “consent audit” of Biobank, which Biobank says they passed. Is this victim blaming what NHS England’s audit found and approved? 

To quote Biobank’s newsletter “In everything we do, we ask, what would participants expect from us?” so are the Biobank statements what one would reasonably expect?

Biobank’s [ public statements ] are incompatible with their [ redacted ] 

[redacted until Biobank fix it or decide they’re willing to take that particular risk with their cohort]

The Guardian work shows how easily NHS patient data is re-identifiable

The Guardian’s efforts confirm that if you know one health event for a person, you can read off all the others through the linking pseudonym, the EID that Biobank’s response argues is so immaterial that it can be published repeatedly on the internet without consequence. 

The Biobank response also argues that if data they have lost control of leaks (as it has), then that’s that if anyone knows anything about your health, and uses their lost data to find out more, then that’s your fault. 

The Department of Health in England makes the same self-serving argument – they take risks with your data and will blame you when they go wrong. Everyone treated in an NHS hospital is in the hospital datasets that NHS England sells, usually without respecting the National Data Opt Out. 

UK Biobank’s sole remaining defence is that it’s difficult for someone you’ve never met and who knows nothing about you to reidentify you – which doesn’t address the fact that you have met many people who know something about you and your health and can now potentially read everything; or the Department of Health in England can stop making stupid mistakes.

None of this is new, the flaws and risks were discussed at length in Chapter 4 of the 2022 Goldacre Review

For Biobank participants who now wish to withdraw

We have heard that participants have withdrawn from Biobank because of their failings over recent years. Biobank claims privately no one has told them they’ve withdrawn for this reason, but then, participants don’t have to give Biobank a reason for withdrawing.

If you’re in Biobank and wish to withdraw, they make you email them for the form. You are required to know your Participant ID, which Biobank probably told you 20 years ago, you can find on some communications from them, or simply download it from the internet with most of your hospital record if you know where to look…

You can withdraw from Biobank, you won’t be allowed to withdraw from the National Data Library.

Biobank’s reckless disregard for personal data has infected the “National Data Library”

The HDR/Sudlow Review which argues that all public sector data should be linked (one topic in the ID cards consultation) and used like Biobank. At the Review launch, the former Chief Scientist of Biobank said Biobank has “one of the best systems” for data access, and Biobank data should be “used as widely as possible”, and has now been rewarded with a seat on the National Data Library advisory board. 

Biobank’s actions exemplify Mr Mandelson culture being applied to NHS data (increasingly so via the Biobank direction), and it will cover everyone everywhere in the UK via the National Data Library.

Unless DSIT agrees that the UK Biobank approach to those in their dataset as covered above will be that of the National Data Library, DSIT should remove Prof Sudlow from the advisory board. Biobank’s public response is the responsibility of the current Biobank senior leadership (most of whom should also resign in disgrace, but won’t as they blame the victims rather than accepting responsibility for their decisions; and wisely no one appointed them to an NDL seat). Responsibility is known and admitted for how Biobank ended up in the mess they have put their cohort in, the only question is whether there will be any consequences for that.

==

In addition to our annual-ish newsletter, you can also join our free substack to get emailed whenever we post some news or commentary.

The Covid Inquiry Module Reports

There has been much ‘chirping’ about the Covid Inquiry: it’s too wide, it’s too narrow; it’s not looking at the right things; it’s not looking in enough detail at the particular bit of the elephant someone spent their time holding; it’s looking in too much detail at another part or parts of the elephant; etc…

The Inquiry contains multitudes (good work on complex topics inevitably must) and the Inquiry’s modular nature lets everyone complain – much as historians can write very different books on events that were vaguely recorded, and they do. 

And those very same chirping interests tend to dismiss others as “Non-Playing Characters” when they want to remove agency from people and groups who will and should be able to make decisions they disagree with. Today’s decision makers are strong and everyone else must accept that, as if today’s decision makers are all that matter.

Data is a common theme in the Inquiry


Data keeps coming up. Everyone involved wants more data at all times to do more things.

Assumptions being made about data by the Department of Health in England (as it is now becoming) are that they want to be able to do anything they choose, and that they should be able to do whatever they want at all times.

The underlying model of DH/E is that Palantir systems will have all the data, and that DH/E controls the configuration. And if DH/E flips the data controller switch again, they’ll be able to do anything they want.

One advantage of Palantir’s product being based upon Apache Spark software is that its capabilities are public; Palantir just does ads on anything they claim they alone can do. Palantir is not magic.

In effect, DH/E’s position is that Mr Streeting wants to be the data controller for your medical notes, and he alone will decide who can use them. (Until he is replaced, of course – then his successor decides…)

These are self-prioritising claims made without external input. Alternatives exist, but Mr Streeting doesn’t care because he knows what he wants and is going to impose it upon you.

If a national analysis were needed, it could be designed nationally, run per “instance”, and the answer could be collated nationally without the “National data integration tenant” (NDIT) having the data. 


DH/E argues opt outs are an impediment because DH/E wants to copy all of the data for its own purposes, and to avoid accountability to patients (or anyone else) because a civil servant might be held responsible for something. 

Patient wishes get ignored when secrecy makes it it easier to ignore them

GP data was collected from practices under the promise of being “covid only”, a promise torn up by the current Secretary of State who doesn’t keep promises he didn’t make. It’s entirely unclear what he expects his successor to do.

It is the Department of Health in England’s view that expecting to have choices over your healthcare is excessive, and data about that care should be unrestricted within the Department – you are to be expected to accept how politicians choose to use and read your medical notes if you accept any form of care at all. If you seek NHS care, what the NHS does with that care is what’s in the interests of the Department, the personal fiefdom of a politician, and not something about which you should have information. That was also the approach to pandemic procurement, a module of the covid inquiry to come out in future.

The Covid Inquiry shows decisions have consequences, and DH/E staff wish to avoid all of those consequences (especially in the “Privacy, Transparency and Trust” team, which refuses to accept privacy, undermines claims of transparency, and undermines Trust. The disingenuous existence of that team means that other parts of DH/E can entirely ignore those principles, claiming to defend them while undermining them directly. 

When it came to the Federated Data Platform, NHS England decided in secret they didn’t have to do a full impact assessment on their “Privacy Enhancing Technologies” because the name said they were privacy enhancing. That’s the sort of decision that causes inquiries, not respects their findings

The hits keep coming, and at some point, like the decision in the Biobank direction, the bill will come due.

Being within a system distorts your perspective on success

It is blatantly obvious that the current dangerous drop in vaccination rates will not be reversed by allowing home visits for vaccination; but it’s something the system can do, so it does it and exaggerates their claims.

It is equally true, as the covid inquiry reported, that the actions of late March were significant even if the same individuals made catastrophically flawed judgments beforehand and afterwards.

The system is doing what the system does. The system has recognised that what it does is entirely out of step with public expectations (and arguably the law), but the system’s response is to want to lower public expectations towards what is easiest for the PTT team. Good luck with that. 

Some of the criticisms of the Inquiry by former officials is that the Inquiry, like some officials think of the public, should do whatever is best for the officials (thinking that also pervades the think tank glossies about how ID cards will be wonderful for think tanks).

How many people like you should the state allow to suffer and die? Why should that number be any different for people in similar circumstances but not like you?

The new structures of DH/E entirely fail to understand that there is a difference between doing the best you can, and doing what is necessary. There’s a difference between saving as many as you can, and inconveniencing as few as possible (and you as little as possible)

The cultural silo that “we alone must do it all, alone” epitomised by the current US administration’s approach to, well, everything,  is also prevalent across DH/E.

Of course, those with the resources to have their own version of covid events will be able to commission their own books when the documents reach The National Archives in due course.  Your health records are widely available to anyone who applies to use them for now because the system doesn’t want to inconvenience valued colleagues by accepting patients wishes.

You only need to look at the Biobank mess to see how this goes wrong.

====


Join our mailing list for occasional substantive updates. In addition, you can join our substack to receive updates when we post updates to our website (subscribers and donations are very welcome – medConfidential currently has no substantive funding for 2026.)

Government tells NHS that GPs should misinform to patients

[14 March: this piece was written and published before The Guardian disclosed that UK Biobank (who will receive the GP data discussed in this piece) had repeatedly leaked NHS data onto the unrestricted internet. We’ll update this shortly, addressing the UK Biobank reassurance which should do anything but – no one knowing about anything in your medical history other than UKB is not realistic, but it is self-serving for UKB. See also A Warning for Experts by Experience]

During the pandemic, your data was collected from every GP under the promise that its use would be for “pandemic only”purposes. That promise is still on your GP’s website. Mr Streeting has decided to tear up that promise and is seeking to do so behind patient’s backs by instructing GPs not to tell patients of the change

This is going to become a very large mess.

GPs were told in 2020 to put on their websites a promise that the data would only be used for the pandemic. That wording is still there. Mr Streeting has Directed NHS England to reuse the data however he wishes, but GPs have been told that “no action” is required and is therefore not monitoring acceptance – despite the fact that taking no action means the practice website will still be telling their patients that the “pandemic only” constraints are still in effect when they have been torn up and Mr Streeting does what he wants with their data. This is the first time, but there are repeats on the horizon.

And this matters even more because Mr Streeting ‘crossed a political rubicon’ when he signed the Direction saying that the NHS can lie to patients about how data is used. Like a Silicon Valley techbro, his actions show he can make a promise today and break it with a swish of his pen tomorrow. (Of course this also means he could choose to reverse this decision and stick to the “pandemic only” promise; if he doesn’t, the die is cast.)

Latest in a line of breached promises

The secretive charity HDRUK already broke the “pandemic only” rules when they trained an AI on “pandemic only” medical records for any purposes they like. When NHS England asked its independent Advisory Group on Data to assess those projects (item 5.1), the majority did not support them. 

NHS England refused to do a meaningful investigation, instead limiting itself to simply asking HDRUK whether a project was only used for pandemic purposes and believing their answer, despite the assessment of their independent advisors. HDRUK persists with the levels of transparency and accountability of the Boris Johnson administration. 

Lying to the press is not a crime”, says Baroness Mone, OBE – and the shared culture of HDRUK and Biobank has applied that same principle to patient data.

Biobank still sends patient data around the world and does not disclose which users are given exceptions to Biobank’s supposed rules. Our list of unanswered questions is long. Biobank continue to allow the company related to the eugenicists to access their servers (which are subsidised by UK tax payers), and Biobank claims “Byte Dance Ltd” are doing genomics research via their Cayman Islands holding company, despite claims the project is based in the United States from staff in China. (Byte Dance makes the TikTok app). After Biobank angrily insisted there was no problem with Biobank giving to insurers data that was donated for research, they later quietly slipped out that they had stopped; a US shell company called Flying Troika was never investigated, and Biobank approved last month a new project explicitly to give undergraduates in China real NHS/Biobank data to use in exercises.

This all matters not only because of the plans for a Single Patient Record and the Health Data Research Service, but because HDRUK and Biobank share a culture and say they designed the HDRS. This all matters because of the proposals for a Health Data Research Service, and because of the proposal for the Secretary of State to be Data Controller for the Single Patient Record.

If a patient has a National Data Opt Out covering research, none of their data should go into HDRS. The Biobank Direction threatens that. The published Direction is only for “consented cohorts”, but a future second half is under discussion covering “unconsented cohorts”, which likely be as broad as HDR’s past requests; cohorts such as “people who have hearts”, or “who are breathing”.

The extent to which HDRS delivers on the goals, delivers for patients, or does neither of those things depends on decisions not yet officially made. As DHSC takes over NHS England, and takes over the data release registers, those DH registers must include all NS data that flows out of any DH entity, otherwise they are by definition incomplete. The NHS itself has largely avoided data re-use scandals over the last decade by having complete registers of decisions made and where data flowed. They may have got in trouble about new datasets, but the decisions on old ones were clear. (until HDRUK tore up that consensus by breaking the covid-only rules – will HDRS look more like the transparent governance of NHS England or the secretive cartel of HDRUK or the unaccountable club that is UK Biobank). There are now many moving parts, and who will gain a short term advantage is unclear. 

HDRS could be good

The new Health Data Research Service could be good; it could be consensual, safe, and transparent. But there’s widespread concerns that it will not be. (We’ll have more on HDRS soon).

HDRS has the opportunity to get it right from day one, and we see no sign of DH allowing them to do that – the job ad for the Chair/CEO of HDRS said that HDRS decisions will be “directly accountable to Ministers”.

If patients who have opted out of their personal data being used in research are overridden by politicians and HDRS so they have their data used in research against their wishes, then that will likely go as well as some of the other pre-u-turn decisions of this government. 

Patients have choices

Polling says that up to 20% of people think they have opted out, but official statistics show only 5.5% actually have a National Data Opt out. If a patient is concerned about whether they have opted out, they can check with the online system, and then they need to do the two or more step process to actually opt out: online individually for your National data, and then on paper for your GP data for your whole family, and then a different paper form for National data for your kids! (The previous government designed this system to create administrative barriers to opting out; the new Government hasn’t made it worse yet at least)


Government is currently going through a ‘process’ (involving push polls about which we have had complaints from those attending the focus groups) to look at whether opt outs should be “reformed” in ways which mean an opt out after some date in the future may do less than the opt out does today. As seen with HDRUK, some in research think patients views don’t apply to them – and NHS England doesn’t want opt outs to apply to their “planning” (i.e. everything Government does in health).

It is important for patient confidence that there be “no surprises”, and that genuine transparency shows that promises are being kept or being carefully changed. Without this, how can any patient have confidence in the decisions being made as it takes control of their medical notes in the Single Patient Record?

Moving Parts: Current and Imminent Government plans for your medical records

14 March update: this was published a few days before The Guardian revealed that Biobank data has been lost several times and Biobank say it’s patient’s fault if they come to harm.]

Government is gearing up for a bunch of announcements about taking your medical record and doing what they want with them – there are many moving parts.

To inform future pieces, we’ve published a long piece on many of those moving parts (not all, because we keep some surprises as surprises).

HDRUK, Biobank, and Mr Streeting argue that privacy doesn’t matter, and they and their cronies should be able to do whatever they want with data without consequence.  They use that argument to break pandemic-only promises, then it’ll be imposing Palantir, and then taking away what rights you do have. Mr Streeting is implementing what he learnt from his political mentor, Mr Mandelson.

We’ll see about that – and perhaps this government will change course when it realises it has done something unwise. 

You can read the full document here.

======


Join our mailing list for occasional substantive updates. In addition, you can join our substack to receive updates when we post updates to our website (subscribers and donations are very welcome – medConfidential currently has no substantive funding for 2026.)