Abstract: Privacy Preserving Data Mining(PPDM) is a rising field of research in Data Mining and various approaches are being introduced by the researchers. One of the approaches is a sanitization process, that transforms the source database into a modified one by removing selective items so that the counterparts or adversaries cannot extract the hidden patterns from. This study address this concept and proposes a revised Item-based Maxcover Algorithm(IMA) which is aimed at less information loss in the large databases with minimal removal of items.
Keywords: Privacy Preserving Data Mining, Restrictive Patterns, Sensitive Transactions, Maxcover, Sanitized database.
[1] Verykios,V.S, Bertino.E, Fovino.I.N, ProvenzaL.P, Saygin.Y and Theodoridis.Y, "State-of-the-art in Privacy Preservation Data Mining", New York,ACM SIGMOD Record, vol.33, no.2, pp.50-57,2004.
[2] Atallah.M, Bertino.E, Elmagarmid.A, Ibrahim.M and Verykios.V.S, "Disclosure Limitation of Sensitive Rules", In Proc. of IEEE Knowledge and Data Engineering Workshop, pages 45–52, Chicago, Illinois, November 1999.
[3] Clifton.C and Marks.D, " Security and Privacy Implications of Data Mining", In Workshop on Data Mining and Knowledge Discovery, pages 15–19, Montreal, Canada, February 1996.
[4] Dasseni.E, Verykios.V.S, Elmagarmid.A.K and Bertino.E, " Hiding Association Rules by Using Confidence and Support", In Proc. of the 4th Information Hiding Workshop, pages 369– 383, Pittsburg, PA, April 2001.
[5] Saygin.Y, Verykios.V.S, and Clifton.C, "Using Unknowns to Prevent Discovery of Association Rules", SIGMOD Record, 30(4):45–54, December 2001.
[6] Oliveira.S.R.M and Zaiane.O.R, "Privacy preserving Frequent Itemset Mining", in the Proc. of the IEEE ICDM Workshop on Privacy, Security, and Data Mining, Pages 43-54, Maebashi City, Japan, December 2002.
[7] Oliveira.S.R.M and Zaiane.O.R, "An Efficient One-Scan Sanitization for Improving the Balance between Privacy and Knowledge Discovery", Technical Report TR 03-15, June 2003.
[8] Cynthia Selvi P, Mohamed Shanavas A.R, "An Effective Heuristic Approach for Hiding Sensitive Patterns in Databases", IOSR-Journal on Computer Engineering, Volume 5, Issue 1(Sep-Oct, 2012), PP 06-11, DOI. 10.9790/0661-0510611.
[9] Agrawal R, Imielinski T, and Swami.A, "Mining association rules between sets of items in large databases", Proceedings of 1993 ACM SIGMOD international conference on management of data, Washington, DC; 1993. p. 207-16.
[10] http://fimi.cs.helsinki.fi/data/