You are viewing a javascript disabled version of the site. Please enable Javascript for this site to function properly.
Go to headerGo to navigationGo to searchGo to contentsGo to footer
In content section. Select this link to jump to navigation

Collecting and managing fuzzy data in statistical relational databases

Abstract

Statistical institutes are focusing on variety of data sources from traditional surveys to big-data. Many of these data and concepts can be expressed as crisp values. But many other data cannot be expressed by precise values. In order to collect, store and manage the fuzziness in data we have adapted the fuzzy meta model as an extension of traditional relational database. Furthermore, experts' knowledge often contains vagueness and subjectivity. If we store this knowledge in a fuzzy database we can build knowledge management systems capable to cope with fuzziness. Statistical institutes cooperate in the data exchange. We have briefly discussed a simple way of extending the SDMX standard to cope with the fuzzy data in a way that does not influence exchanging precise values. Our research was focused on examining promising ways for managing fuzziness of real world because statistical institutes have been starting to analyze variety of promising data sources where not all data are always precise.

References

[1] 

Ahas R., , Tiru M., , Saluveer E., and Demunter C., Mobile telephones and mobile positioning data as source for statistics: Estonian experiences. Proc New Techniques and Technologies for Statistics (NTTS). Brussels; (2011) .

[2] 

Altin L., , Tiru M., , Saluveer E., and Puura A., Using passive mobile positioning data in tourism and population statistics. Proc New Techniques and Technologies for Statistics (NTTS). Brussels; (2015) .

[3] 

Balbi S., , Hudec M., , Juriová J., , Klúčik M., , Stawinoga A., and Triunfo N., Report on Analysis of Existing Practices in the Data Collection Field, Deliverable 5.2, Blue-Ets Project (FP7); (2013) .

[4] 

Becker G., and Bruschi M., An SDMX-based unified data catalogue (UDC). Proc UNECE/OECD/Eurostat Meeting on Management of Statistical Information Systems. Dajeon; (2010) .

[5] 

Costanzo L., , Di Bella G., , Hargreaves E., , Pereira H.J., and Rodrigues S., An overview of the use of administrative data for business statistics in Europe. Porc 58th World Statistical Congress. Dublin; (2011) .

[6] 

Daas P., , Roos M., , De Blois C., , Hoekstra R., , Ten Bosch O., and Ma Y., New data sources for statistics: experiences at Statistics Netherlands. The Hague/Heerlen: Statistics Netherlands; (2011) .

[7] 

Date C.J., Date on databases: Writings 2000-2006. New York: Apress; (2006) .

[8] 

De Tre G., , Matthe T., , KordJamshidi P., and Demoor M., On the use of case based reasoning techniques in flexible querying. Proc 18th International Workshop on Database and Expert Systems Applications. Regensburg; (2007) .

[9] 

Eurostat, Introduction to SDMX, Student Book, Eurostat Directorate B. Luxembourg: Eurostat; (2010) .

[10] 

Dunne J., and Hayes J., Realising the statistical potential of administrative data. Proc Seminar on New Frontiers for Statistical Data Collection. Geneva; (2012) .

[11] 

Galindo J., editor. Handbook of research on fuzzy information processing in databases. Hershey: Information Science Reference; (2008) .

[12] 

Galindo J., , Urrutia A., and Piattini M., Fuzzy databases - Modeling, design and implementation. Hershey: Idea Group Publishing; (2006) .

[13] 

Herrera-Viedma E., , Porcel C., , López-Herrera A., and Alonso S., A fuzzy linguistic recommender system to advice research resources in university digital libraries. Studies in fuzziness and soft computing, volume 220: . Springer; (2008) . pp. 567-585.

[14] 

Hudec M., Storing and analysing fuzzy data from surveys by relational databases and fuzzy logic approaches. Proc XXV-th IEEE International Conference on Information, Communication and Automation Technologies. Sarajevo; (2015) .

[15] 

Hudec M., Managing fuzziness of real world in business informatics. Proc Strategic Management and Support by Information Systems. Uherské Hradiště; (2015) .

[16] 

Hudec M., Fuzzy data in traditional relational databases. Proc 12th Symposium on Neural Network Applications in Electrical Engineering. Belgrade; (2014) .

[17] 

Hudec M., Fuzzy database queries in official statistics: Perspective of using linguistic terms in query conditions, Stat J IAOS 29: (4) ((2013) ), 315-323.

[18] 

Hudec M., and Vujošević M., A fuzzy system for municipalities classification, Central European J. Operations Res 18: (2) ((2010) ), 171-180.

[19] 

Klement E.P., , Mesiar R., and Pap E., Triangular norms. Dordrecht: Kluwer Academic Publishers; (2000) .

[20] 

Lu J., , Wu D., , Mao M., , Wang W., and Zhang G., Recommender system application developments: A survey, Decision Support Syst 74: ((2015) ), 12-32.

[21] 

Medina J.M., , Pons O., and M.A. Villa, GEFRED: A generalized model of fuzzy relational databases, Inf Sci 76: (1-2) ((1994) ), 87-109.

[22] 

Meyer A., and H.J. Zimmermann, Applications of fuzzy technology in business intelligence, Int J Comput Communicat Control VI: (3) ((2011) ), 428-441.

[23] 

Pavese F., Why should correction values be better known than the measurand true value? J. of Physics, Conf. series 459; (2013) .

[24] 

Petry F., Fuzzy databases - Principles and applications. Boston: Kluwer; (1996) .

[25] 

Puts M., , Daas P., and Tennekes M., High frequency road sensor data for official statisitics. Proc New Techniques and Technologies for Statistics (NTTS). Brussels; (2015) .

[26] 

Portinale L., and Verrua A., Exploiting Fuzzy-SQL in case-based reasoning. Proc 14th International Florida Artificial Intelligence Research Society Conference (FLAIRS). Key West; (2001) .

[27] 

Prade H., and Testemale C., Fuzzy relational databases: Representational issues and reduction using similarity measures, J. American Society Inf. Sci. 38: (2) ((1987) ), 118-126.

[28] 

Praženka D., and Boško P., Combining technical standards for statistical business processes from end-to-end. Proc New Techniques and Technologies for Statistics (NTTS). Brussels; (2011) .

[29] 

Řkrbić S., Using fuzzy logic in relational databases. Ph.D. Dissertation, University of Novi Sad; (2008) .

[30] 

Řkrbić S., , Racković M., and Takači A., Towards the methodology for development of fuzzy relational database applications, Comput. Sci. Inf. Syst. 8: (1) ((2011) ), 27-40.

[31] 

Torres van Grinsven V., and Snijkers G., Sentiments and perceptions of business respondents on social media: an exploratory analysis, J. Official Statist. 31: (2) ((2015) ), 283-304.

[32] 

Vale S., Using administrative and secondary sources for official statistics - A handbook of principles and practices. Geneva: UNECE; (2011) .

[33] 

Viertl R., Fuzzy data and information systems. Proc WSEAS International Conference on Systems, Corfu; (2011) .

[34] 

Vucetic M., , Hudec M., and Vujošević M., A new method for computing fuzzy functional dependencies in relational database systems, Expert Syst. Appl. 40: (7) ((2013) ), 2738-2745.

[35] 

Yager R.R., Fuzzy logic methods in recommender systems, Fuzzy Sets Syst. 136: (2) ((2003) ), 133-149.

[36] 

Zadeh L.A., From computing with numbers to computing with words - from manipulation of measurements to manipulation of perceptions, in: Computing with words, P. Wang, ed., New York: John Wiley & Sons; (2001) . pp. 35-68.

[37] 

Zadeh L.A., Fuzzy sets, Inf. Control 8: (3) ((1965) ), 338-353.

[38] 

Zimmermann H.J., Fuzzy set theory and its applications, London: Kluwer Academic Publishers; (2001) .