“Secure” Log-Linear and logistic regression analysis of distributed databases

Stephen E. Fienberg, William J. Fulp, Aleksandra B. Slavkovic, Tracey A. Wrobel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Citations (Scopus)

Abstract

The machine learning community has focused on confidentiality problems associated with statistical analyses that “integrate” data stored in multiple, distributed databases where there are barriers to simply integrating the databases. This paper discusses various techniques which can be used to perform statistical analysis for categorical data, especially in the form of log-linear analysis and logistic regression over partitioned databases, while limiting confidentiality concerns. We show how ideas from the current literature that focus on “secure” summations and secure regression analysis can be adapted or generalized to the categorical data setting.

Original languageEnglish (US)
Title of host publicationPrivacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings
EditorsLuisa Franconi, Josep Domingo-Ferrer
PublisherSpringer Verlag
Pages277-290
Number of pages14
ISBN (Print)9783540493303
StatePublished - Jan 1 2006
EventCENEX-SDC Project of International Conference on Privacy in Statistical Databases, PSD2006 - Rome, Italy
Duration: Dec 13 2006Dec 15 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4302
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

OtherCENEX-SDC Project of International Conference on Privacy in Statistical Databases, PSD2006
CountryItaly
CityRome
Period12/13/0612/15/06

Fingerprint

Distributed Databases
Nominal or categorical data
Confidentiality
Logistic Regression
Linear regression
Regression Analysis
Regression analysis
Logistics
Summation
Statistical Analysis
Machine Learning
Limiting
Integrate
Learning systems
Statistical methods
Community
Form

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Fienberg, S. E., Fulp, W. J., Slavkovic, A. B., & Wrobel, T. A. (2006). “Secure” Log-Linear and logistic regression analysis of distributed databases. In L. Franconi, & J. Domingo-Ferrer (Eds.), Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings (pp. 277-290). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4302). Springer Verlag.
Fienberg, Stephen E. ; Fulp, William J. ; Slavkovic, Aleksandra B. ; Wrobel, Tracey A. / “Secure” Log-Linear and logistic regression analysis of distributed databases. Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings. editor / Luisa Franconi ; Josep Domingo-Ferrer. Springer Verlag, 2006. pp. 277-290 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{d14155ce024649c2abf7a32f500ca367,
title = "“Secure” Log-Linear and logistic regression analysis of distributed databases",
abstract = "The machine learning community has focused on confidentiality problems associated with statistical analyses that “integrate” data stored in multiple, distributed databases where there are barriers to simply integrating the databases. This paper discusses various techniques which can be used to perform statistical analysis for categorical data, especially in the form of log-linear analysis and logistic regression over partitioned databases, while limiting confidentiality concerns. We show how ideas from the current literature that focus on “secure” summations and secure regression analysis can be adapted or generalized to the categorical data setting.",
author = "Fienberg, {Stephen E.} and Fulp, {William J.} and Slavkovic, {Aleksandra B.} and Wrobel, {Tracey A.}",
year = "2006",
month = "1",
day = "1",
language = "English (US)",
isbn = "9783540493303",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "277--290",
editor = "Luisa Franconi and Josep Domingo-Ferrer",
booktitle = "Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings",
address = "Germany",

}

Fienberg, SE, Fulp, WJ, Slavkovic, AB & Wrobel, TA 2006, “Secure” Log-Linear and logistic regression analysis of distributed databases. in L Franconi & J Domingo-Ferrer (eds), Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4302, Springer Verlag, pp. 277-290, CENEX-SDC Project of International Conference on Privacy in Statistical Databases, PSD2006, Rome, Italy, 12/13/06.

“Secure” Log-Linear and logistic regression analysis of distributed databases. / Fienberg, Stephen E.; Fulp, William J.; Slavkovic, Aleksandra B.; Wrobel, Tracey A.

Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings. ed. / Luisa Franconi; Josep Domingo-Ferrer. Springer Verlag, 2006. p. 277-290 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4302).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - “Secure” Log-Linear and logistic regression analysis of distributed databases

AU - Fienberg, Stephen E.

AU - Fulp, William J.

AU - Slavkovic, Aleksandra B.

AU - Wrobel, Tracey A.

PY - 2006/1/1

Y1 - 2006/1/1

N2 - The machine learning community has focused on confidentiality problems associated with statistical analyses that “integrate” data stored in multiple, distributed databases where there are barriers to simply integrating the databases. This paper discusses various techniques which can be used to perform statistical analysis for categorical data, especially in the form of log-linear analysis and logistic regression over partitioned databases, while limiting confidentiality concerns. We show how ideas from the current literature that focus on “secure” summations and secure regression analysis can be adapted or generalized to the categorical data setting.

AB - The machine learning community has focused on confidentiality problems associated with statistical analyses that “integrate” data stored in multiple, distributed databases where there are barriers to simply integrating the databases. This paper discusses various techniques which can be used to perform statistical analysis for categorical data, especially in the form of log-linear analysis and logistic regression over partitioned databases, while limiting confidentiality concerns. We show how ideas from the current literature that focus on “secure” summations and secure regression analysis can be adapted or generalized to the categorical data setting.

UR - http://www.scopus.com/inward/record.url?scp=84958974400&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84958974400&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9783540493303

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 277

EP - 290

BT - Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings

A2 - Franconi, Luisa

A2 - Domingo-Ferrer, Josep

PB - Springer Verlag

ER -

Fienberg SE, Fulp WJ, Slavkovic AB, Wrobel TA. “Secure” Log-Linear and logistic regression analysis of distributed databases. In Franconi L, Domingo-Ferrer J, editors, Privacy in Statistical Databases - CENEX-SDC Project International Conference, PSD 2006, Proceedings. Springer Verlag. 2006. p. 277-290. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).