Package: sdcMicro 5.7.8

Matthias Templ

sdcMicro: Statistical Disclosure Control Methods for Anonymization of Data and Risk Estimation

Data from statistical agencies and other institutions are mostly confidential. This package, introduced in Templ, Kowarik and Meindl (2017) <doi:10.18637/jss.v067.i04>, can be used for the generation of anonymized (micro)data, i.e. for the creation of public- and scientific-use files. The theoretical basis for the methods implemented can be found in Templ (2017) <doi:10.1007/978-3-319-50272-4>. Various risk estimation and anonymization methods are included. Note that the package includes a graphical user interface published in Meindl and Templ (2019) <doi:10.3390/a12090191> that allows to use various methods of this package.

Authors:Matthias Templ [aut, cre], Bernhard Meindl [aut], Alexander Kowarik [aut], Johannes Gussenbauer [aut], Organisation For Economic Co-Operation And Development [cph], Statistics Netherlands [cph], Pascal Heus [cph]

sdcMicro_5.7.8.tar.gz
sdcMicro_5.7.8.zip(r-4.5)sdcMicro_5.7.8.zip(r-4.4)sdcMicro_5.7.8.zip(r-4.3)
sdcMicro_5.7.8.tgz(r-4.4-x86_64)sdcMicro_5.7.8.tgz(r-4.4-arm64)sdcMicro_5.7.8.tgz(r-4.3-x86_64)sdcMicro_5.7.8.tgz(r-4.3-arm64)
sdcMicro_5.7.8.tar.gz(r-4.5-noble)sdcMicro_5.7.8.tar.gz(r-4.4-noble)
sdcMicro_5.7.8.tgz(r-4.4-emscripten)sdcMicro_5.7.8.tgz(r-4.3-emscripten)
sdcMicro.pdf |sdcMicro.html
sdcMicro/json (API)
NEWS

# Install 'sdcMicro' in R:
install.packages('sdcMicro', repos = c('https://sdctools.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/sdctools/sdcmicro/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
Datasets:

On CRAN:

10.09 score 79 stars 256 scripts 1.2k downloads 4 mentions 61 exports 121 dependencies

Last updated 1 months agofrom:ac5e73db00. Checks:OK: 7 NOTE: 2. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 06 2024
R-4.5-win-x86_64NOTENov 06 2024
R-4.5-linux-x86_64NOTENov 06 2024
R-4.4-win-x86_64OKNov 06 2024
R-4.4-mac-x86_64OKNov 06 2024
R-4.4-mac-aarch64OKNov 06 2024
R-4.3-win-x86_64OKNov 06 2024
R-4.3-mac-x86_64OKNov 06 2024
R-4.3-mac-aarch64OKNov 06 2024

Exports:addGhostVarsaddNoiseargus_microaggregationargus_rankswapcalcRiskscreateDatcreateNewIDcreateSdcObjdataGendRiskdRiskRMDdUtilityextractManipDatafreqfreqCalcgenerateStrataget.sdcMicroObjglobalRecodegroupAndRenameIL_correlIL_variablesimportProblemindivRiskinfoLosskAnonkAnon_violationsldiversityLocalRecProglocalSupplocalSuppressionmafastmeasure_riskmergeHouseholdDatamicroaggregationmicroaggrGowermodRiskmvTopCodingnextSdcObjplot.localSuppressionplotMicropramprintrankSwapreadMicrodatarecordSwapremoveDirectIDreportriskyCellssdcAppselectHouseholdDataset.sdcMicroObjshowshufflestrataVar<-suda2topBotCodingundolastvalTablevarToFactorvarToNumericwriteSafeFile

Dependencies:abindbackportsbase64encbitbit64bootbroombslibcachemcarcarDataclassclicliprclustercolorspacecommonmarkcowplotcpp11crayoncrosstalkdata.tableDEoptimRDerivdigestdoBydplyrDTe1071evaluatefansifarverfastmapfontawesomeforcatsFormulafsgenericsggplot2gluegtablehavenhighrhmshtmltoolshtmlwidgetshttpuvisobandjquerylibjsonliteknitrlabelinglaekenlaterlatticelazyevallifecyclelme4lmtestmagrittrMASSMatrixMatrixModelsmemoisemgcvmicrobenchmarkmimeminqamodelrmunsellnlmenloptrnnetnumDerivpbkrtestpillarpkgconfigprettydocprettyunitsprogresspromisesproxypurrrquantregR6rangerrappdirsRColorBrewerRcppRcppEigenreadrrhandsontablerlangrmarkdownrobustbasesassscalesshinyshinyBSsourcetoolsspSparseMstringistringrsurvivaltibbletidyrtidyselecttinytextzdbutf8vcdvctrsVIMviridisLitevroomwithrxfunxtableyamlzoo

Targeted Record Swapping

Rendered fromrecordSwapping.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2024-02-18
Started: 2022-02-22

Using the interactive GUI - sdcApp

Rendered fromsdcMicro.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2019-10-30
Started: 2018-03-23

Readme and manuals

Help Manual

Help pageTopics
addGhostVarsaddGhostVars
Adding noise to perturb dataaddNoise
argus_microaggregationargus_microaggregation
argus_rankswapargus_rankswap
Recompute Risk and Frequencies for a sdcMicroObjcalcRisks
Small Artificial Data setcasc1
Census data setCASCrefmicrodata
Dummy Dataset for Record SwappingcreateDat
Creates new randomized IDscreateNewID
Fast generation of synthetic datadataGen
Distribute number of swapsdistributeDraws_cpp
DistributedistributeRandom_cpp
overal disclosure riskdRisk
RMD based disclosure riskdRiskRMD
Data-Utility measuresdUtility
EIA data setEIA
Remove certain variables from the data set inside a sdc object.extractManipData
data from the casc projectfrancdat
Demo data set from mu-Argusfree1
Freqfreq
Frequencies calculation for risk estimationfreqCalc
Generate one strata variable from multiple factorsgenerateStrata
get.sdcMicroObjget.sdcMicroObj
Global RecodingglobalRecode
Join levels of a variables in an object of class 'sdcMicroObj-class' or 'factor' or 'data.frame'groupAndRename
Additional Information-Loss measuresIL_correl IL_variables print.il_correl print.il_variables
importProblemimportProblem
Individual Risk computationindivRisk
Calculate information loss after targeted record swappinginfoLoss
'kAnon_violations'kAnon_violations kAnon_violations,sdcMicroObj,logical,numeric-method
Local recoding via Edmond's maximum weighted matching algorithmLocalRecProg
Local SuppressionlocalSupp
Local Suppression to obtain k-anonymitykAnon localSuppression
Fast and Simple Microaggregationmafast
Disclosure Risk for Categorical Variablesldiversity measure_risk print.ldiversity print.measure_risk
Replaces the raw household-level data with the anonymized household-level data in the full dataset for anonymization of data with a household structure (or other hierarchical structure). Requires a matching household ID in both files.mergeHouseholdData
Microaggregationmicroaggregation
Microaggregation for numerical and categorical key variables based on a distance similar to the Gower DistancemicroaggrGower
microDatamicroData
Global risk using log-linear models.modRisk
Detection and winsorization of multivariate outliersmvTopCoding
nextSdcObjnextSdcObj
Reorder dataorderData_cpp
Plots for localSuppression objectsplot.localSuppression
Plotfunctions for objects of class sdcMicroObjplot.sdcMicroObj
Comparison plotsplotMicro
Post Randomizationpram
Print method for objects from class freqCalc.print.freqCalc
Print method for objects from class indivRiskprint.indivRisk
Print method for objects from class localSuppressionprint.localSuppression
Print method for objects from class microprint.micro
Print method for objects from class modriskmodrisk print.modrisk
Print method for objects from class pramprint.pram
Print and Extractor Functions for objects of class 'sdcMicroObj-class'print,sdcMicroObj-method print.sdcMicroObj
Print method for objects from class suda2print.suda2
Random SamplingrandSample_cpp
Rank SwappingrankSwap
readMicrodatareadMicrodata
Targeted Record SwappingrecordSwap recordSwap.default recordSwap.sdcMicroObj
Targeted Record SwappingrecordSwap_cpp
Remove certain variables from the data set inside a sdc object.removeDirectID
Generate an Html-report from an sdcMicroObjreport
riskyCellsriskyCells
Random sample for donor recordssampleDonor_cpp
sdcAppsdcApp
Class '"sdcMicroObj"'createSdcObj sdcMicroObj-class strataVar<- strataVar<-,sdcMicroObj,characterOrNULL-method undolast
Creates a household level file from a dataset with a household structure.selectHouseholdData
set.sdcMicroObjset.sdcMicroObj
Define Swap-LevelssetLevels_cpp
Calculate RisksetRisk_cpp
Showshow,sdcMicroObj-method
Shuffling and EGADPshuffle
subsetMicrodatasubsetMicrodata
Suda2: Detecting Special Uniquessuda2
Summary method for objects from class freqCalcsummary.freqCalc
Summary method for objects from class microsummary.micro
Summary method for objects from class pramsummary.pram
Tarragona data setTarragona
A real-world data set on household income and expenditurestestdata testdata2
Top and Bottom CodingtopBotCoding
Comparison of different microaggregation methodsvalTable
Change the a keyVariable of an object of class 'sdcMicroObj-class' from Numeric to Factor or from Factor to NumericvarToFactor varToNumeric
writeSafeFilewriteSafeFile