Task #7396 (closed)
Opened 8 years ago
Closed 6 years ago
HIC: What counts as "risky" access to datasets?
| Reported by: | szwells | Owned by: | |
|---|---|---|---|
| Priority: | minor | Milestone: | Unscheduled |
| Component: | General | Version: | n.a. |
| Keywords: | n.a. | Cc: | hic@… |
| Resources: | n.a. | Referenced By: | n.a. |
| References: | n.a. | Remaining Time: | n.a. |
| Sprint: | n.a. |
Description (last modified by szwells)
It has been suggested that queries over project datasets which return either too much or too little data can be considered risky.
- Too much data returned could indicate a user is trying to circumvent access restrictions in order to "steal" the data set -- This represents a commercial risk given the value inherent in large medical datasets
- Too little data returned could risk the identification of individuals. This is especially true where the intersection of individual queries isolates very small numbers of individuals
- Trying to link across projects shouldn't be done by researchers and should be flagged up if this happens
We are trying to identify the conditions under which particular patterns of behaviour in accessing the datasets could be recognised and flagged to the HIC data manager.
Change History (3)
comment:1 Changed 8 years ago by abell-x
comment:2 Changed 8 years ago by szwells
- Description modified (diff)
Thanks Alison, added to the list in the body of the ticket
comment:3 Changed 6 years ago by jamoore
- Resolution set to invalid
- Status changed from new to closed
Closing all specific HIC tasks.
Trying to link across projects shouldn't be done by researchers and should be flagged up if this happens