Anooja Ali
Published: 2024-09-12
Total Pages: 219
Get eBook
Managing Unstructured Data: NoSQL Database Essentials-is a reference book and guide for teaching and reading skills to college faculty and students. In Chapter1 the fundamentals of database and relational data base are discussed. This chapter helps students to understand data management concepts by data modelling, schema design, data storage and retrieval. This chapter includes the foundational skills that are applicable across various industries and provides a stepping stone for further specialization and career development. The chapter 2 is all about unstructured data. Varying methods for managing, analysing, and storing data are needed for varying levels of organization and complexity, which are represented by structured, unstructured, and semi-structured data. This chapter provides a platform for students to understand the transition from structured to unstructured data in terms of data management and analysis and it is a pivotal aspect of modern data management. In chapter 3 concepts of NoSQL data base and the major differences with SQL & Relational data bases are highlighted. This chapter explains the adoptions of NoSQL with flexible schema, scalability, high performance and support for distributed architecture. Chapter 4 is all about NoSQL databases, or "Not Only SQL" databases which represent a diverse set of database technologies designed to address specific challenges not well served by traditional relational databases. A brief overview of the main types of NoSQL databases are discussed here. The four basic data models such as key-value pairs, document-oriented, columnar, and graph-based structures are represented in this chapter. Information on popular NoSQL database technologies is given in chapter 5. Details of technologies like Apache HBase, Apache CouchDB, Neo4j, Apache Cassandra and their comparison are also provided here. It includes the distributed architecture with fault tolerance, high availability, and disaster recovery capabilities for ensuring data integrity and business continuity. Chapter 6 discusses the overview of Mongo DB which is a document-oriented NoSQL database known for its flexibility, scalability, and ease of use. The features of Mongo DB including document store, MongoDB protocol, horizontal scalability, cross platform compatibility, replication and sharding are also covered here. Chapter 7 deals with Concurrency control in databases. It discusses about the methods to obtain concurrency in structured data, and then in unstructured data, challenges in concurrency control for unstructured data, commits in transaction and the different isolation levels. Chapter 8 discusses on how unstructured data are used in big data processing. It includes Query processing performance evaluation in big data systems, the types od dirty data. Data cleansing is explained in detail with the steps in cleansing, exploratory data analysis, and data visualization. Hope this book on Managing Unstructured Data: NoSQL Database Essentials will provide a handy and useful reference book for teachers and students on Unstructured Database.