You are on page 1of 6

International Journal of Trend in Scientific Research and Development (IJTSRD)

International Open Access Journal | www.ijtsrd.com

ISSN No: 2456 - 6470 | Volume - 2 | Issue – 6 | Sep – Oct 2018

NOSQL Database Engines for


or Big Data Management
Mrs. Yasmeen
Assistant Professor, Department of Computer Science and Engineering,
Engineering
SSM College of Engineering and Technology
Technology, Baramulla, Jammu and Kashmir,
Kashmir India

ABSTRACT
We are living in the digital world and last two decades Database also provides a mechanism for querying,
have seen significant expansion in the information on creating, modifying and deleting data. A list can also
internet technology. In present digital world the IOT be used to store data but in a list, redundancy
redun is a
is most popular term means computers, mobile phones major issue. A database can store relationships and
and physical devices like sensors are connected to data that are more complicated than a simple list with
internet. With the rapid outreach of internet it is very lesser or no redundancy. A relational database stores
important to focus on technological advancements for data in tables. Normally a table is based on one
managing huge amount of data with easy access. information theme. For example, an employee list can
be divided into manager table, intern table, and junior
Keywords: Sensor, IOT, NOSQL staff table. A table is a two dimensional grid of data
that contains columns and rows. The convention in
I. INTRODUCTION relational database world is that columns represent
A database is a collection of data items that provides different attributes of an n entity and each row
an organizational structure for information storage. represents the instance of the entity.

Database Application Database Management


System Database

Figure: A Database System

Conceptually, database is a component of database creates and manages the database. DBMS can be
system. Besides database, database system consists of categorized based on its data model. Relational
database users, database applications and Database Database Management Systems (RDBMS) [50] use
Management Systems (DBMS). Database users need relational data model given by Dr. E.F. Codd.
not to be always human. It is possible, for example, RDBMS maintain data in tables and relationships
for
or other software programs to be users of the which are created among data and tables. Database is
database. Users interact with database application and divided into tables and they are connected through a
application further depends on the DBMS to extract "key field". RDBMS is the most famous and used
and store data in the database. The DBMS acts as a database model.
gatekeeper. All the information owing in or out of
database
tabase must pass through the DBMS. It is a critical Over last four decades, RDBMS remain a key
mechanism for maintaining quality of data and technology to store structured
d data. But with growing
database. Users and database applications are not size of data, companies do need modern technologies
allowed directly to interact with database. A Database to maintain and process data. RDBMS are not that
Management System is an intermediary between good for large data volumes with varying datatypes.
database applications and database. The DBMS They also have scalability problem and often result

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 617
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
into failure while performing ing distributed sharding. Mobile world is one of the most dynamic
dyna areas of
Oracle Real Application Clusters (RAC) is a Information Technology today. Smart phones and
relational database cluster that provides high tablets have created a huge market for mobile
availability, reliability and performance. Also, applications. Consequently there is an increasing
MySQL cluster is another example where relational demand for mobile application developers. Almost all
databases scale on large cluster. RDBMS satisfy of the mobile applications require a persistent data
ACID (Atomicity, Consistency, Isolation and layer, including options for queries. So the interest of
Durability) properties defined by Jim Gray in the late database professionals, academics and researchers for
1970s. Consistency is bottleneck for scalability of mobile technologies is increasing. NOSQL approach
relational databases. RDBMS follow strict data model is a strong competitor to the relational model because
and can not violate ACID properties. That is why it supports high scalability. The famous CAP theorem
NoSQL data stores were developed to address the describes that not any database system supports all the
challenges of traditional databases. three attributes but only two of three is possible.
Relational databases support only consistency and
II. NOSQL DATABASES partition tolerance properties and the NOSQL
In a computing system, huge amount of data comes databases support the last two means
mea availability and
out every day from the web. A large section of these partition tolerance for high availability and
data is handled by Relational database management partitioning of data.
systems (RDBMS). The idea of relational model came
with E. F. Codd’s 1970 paper named "A relational Types:
model of data for large shared data banks" which There are three main types of NoSQL data stores:
made data modelling and application programming Key-Value Data stores, Extensible Record Data stores
much easier. Beyond the benefits, the relational model and Document Data stores.
is also well-suited for client-server
server programming and  In the Key-Value data stores,
stores the values are
today it is a predominant technology for storing indexed with keys and its data model follows a
structured data in web and business applications famous memcached distributed in-memory
in cache.
.Applications
Applications also grow with time and pose Examples include Project Voldemort, Riak and
challenging demands for the data management. As Redis.
stated by Jim Gray, the most challenging part is to  Document data stores retrieve, manage and store
understand the data and find patterns, trends, semi structured data. They provide support for
anomalies and extract the relevant information. With multiple forms of documents (object). The values
the advent of Web 2.0 applications, the data stores are stored in documents as lists or nested
needed to scale to OLTP/OLAP-style style application documents. Few examples are MongoDB,
loads where millions of users read and update the SimpleDB, and CouchDB.
information, in contrast to the traditional data stores.  Extensible Record data stores are motivated from
These data stores provide good horizontal scalability Google's Big Table. It has flexible data model
for the simple read/write operations distributed over with rows and columns. Rows and columns can
many servers. The relational database systems have split over multiple nodes. HBase, Hyper Table,
little capability to horizontally scale to these levels. and PNUTS are its few examples.
So, this paved the way to seek alternative solutions for
scenarios where relational database systems proved to III. DOCUMENT DATA STORES
be not the right choice. NOSQL database are growing Document oriented data stores are design to store,
fast and are best choice for handlingg the big world retrieve and manage semi structured data. They
problem popularly known as Big Data and supporting support multiple types of documents (objects) per data
Business Intelligence in organisations. Today we need stores.. "Documents" save values as nested documents
rich mobile apps highly available, very responsive and or lists. These documents are of any type ranging
not affected by network availability. To develop such from PDF, Word document, XML, HTML, etc.
modern mobile apps NOSQL (mobile mobile databases) are SimpleDB, CouchDB, and MongoDB are few
the best solution for modern mobile app development. examples of Document Oriented datastore.
NOSQL use wide variety of different DB
technologies that came into existence in response to MongoDB: MongoDB is an open source, document
the demands present in building modern applications. oriented datastore that is written in C++. It is

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 618
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
developed by 10gen (Now MongoDB Inc.) for a wide Different tools for data and binary import/export
variety of real time applications. It also provides full functionalities are the part of MongoDB package.
index support for collection of documents. MongoDB Different
ferent MongoDB tools are depicted in the
has a well structured document query mechanism. following figure. Mongod is the primary daemon
Next few subsections discuss different aspects of process for the MongoDB system. It takes care of data
MongoDB design. requests, manages data format and executes
background management operations. Datastore is a
NoSQL data stores are quite handy to deal with much physical container for collections. Each datastore gets
large velocity and volume of data. MongoDB is a its own set of files on the file system. A single
scalable and high performance NoSQL datastor
datastore. It is MongoDB server typically has multiple data stores.
an agile datastore that allows schemas to change Unlike Extensible Record store datastore like HBase,
quickly as applications evolve. It is provided with the MongoDB does not require a file system to run.
rich querying capabilities. MongoDB is a real time Collection is a groupp of MongoDB documenet and is
datastore usually used for online data but also find equivalent to a RDBMS table. Collections do not
applicability in wide variety of indus industries. The enforce any type of schema. Documents within a
MongoDB package has different tools. Depending on collection can have different fields. Normally, all
operating system, the MongoDB package has documents in a collection are of similar or related
different package components. Mongod, mongo and purpose. Inside one collection,
ion, user can have "n"
mongos are the core processes of MongoDB package. number of documents. Document has a JavaScript
Mongod is responsible for database whereas mongos Object Notation (JSON) structure that stores a set of
is for sharded cluster. Mongo is the interactive shell or key/value pairs. Normally, all documents in a
the client. For the Windows environment, there are collection are of similar purpose.
specific services like mongod.exe and mongos.exe.

Figure: MongoDB Packag


Package Components

IV. EXTENSIBLE RECORD DATA STORES HyperTable are few of the famous Extensible Record
Google's Big Table is the motivation for extensible stores. Extensible
tensible record stores are scalable and both
record database engines. It has a flexible data model rows and columns can split over multiple nodes.
with rows and columns which can be extended any Extensible Record stores are often term as Column
time. Apache HBase, Apache Accumulo, and Oriented data stores.

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 619
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
HBase: HBase is a Column Oriented data store that database. Each table has a column defines as it
runs on top of HDFS. HBase is an open sou source primary key and all calls to access the table must use
Apache project which can be summarized as the primary key. HBase architecture has three layers
distributed, fault tolerant scalable data store. It is good namely:
y: the client layer, the server layer and the
in managing sparse data sets. Unlike a relational storage layer. The client layer provides an interface to
database management system (RDBMS), it does not the user. It has client library which is used to
support structured query language like SQL. In fact, communicate with the HBase installation. The storage
HBase is not at all a relational database. HBase is layer has a coordination system and a file system.
written in Java much like a typical Hadoop HDFS is the most commonly used file system for
application but it does not use MapReduce. HBase HBase. Apache ZooKeeper is used as the coordination
applications can also be written using AVRO, REST service for HBase. A master server and the region
and THRIFT API. A HBase system is made up of set servers are two component of server layer. The
of tables. These
hese tables are stored in HDFS. Each table following figure describes the architecture overview
contains rows and columns much like a traditional of HBase.

Figure: HBase Architectur

V. KEY-VALUE DATA STORES very high write throughput. Apache Cassandra is an


Key-value: Key-value data stores are primarily a big open source project
roject under the Apache license 2.0.
hash tables with unique primary key and a pointer to a In traditional databases that can be deployed over
particular data item. Its data model has identical multiple nodes and even in data stores like HBase,
design to the memcached in memory cache. The keys Google's Bigtable etc, master slave relationship exist
can be primitive types or objects and values are between the nodes. The master is authority for
accessed only by keys. These data
ata stores provide sup
sup- distributing and
nd managing data. Slaves on other hand
port for much functionality like replication, partition, synchronize their data to the master. All writes pass
locking, versioning, transactions and/or other features. via master and it is the single point of failure. The
They are extremely useful in building specialized architectures that have master/slave setup sometime
application with super fast query capabilities. have adverse effect if master node fails.
Cassandra, Redis, Riak, k, Scalaris, and Project By contrast, Cassandra
ssandra was designed with the
Voldemort are few examples of key-value
value data stores. understanding that failures can and do occur. It has a
peer-to-peer
peer distribution model. The data is divided
Cassandra: Cassandra is a distributed, highly scalable among all nodes in the cluster. All nodes are
and fault tolerant NoSQL datastore. It is a structured structurally identical. Therefore, there is no master
store with decentralized architecture. It was developed node. Equality among ng nodes due to peer-to-peer
peer
by Facebook Inc. and its first release came out in network improves general datastore ability. It also
2008. The main aim to develop Cassandra is to meet makes scaling up and scaling down much easier
storage requirements of the Index Search Problem. because a new node will not be treated differently.
For this purpose, Facebook needed a datastore with The following figure describes the Cassandra Read
Repair.

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 620
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
VI. CONCLUSION
Big Data is a very popular term today represented by commonly known as NOSQL Databases and how
3V i.e volume, variety and velocity of data. The they are beneficial for managing huge volumes of
research paper focus on new breed of databases data.

Figure: Read Repair of Cassandra

REFRENCES: SQL Databases ses on the TWC Website”,Website


1. A B Moniruzzaman and Syed AkhtarHossain, American Journal of Software Engineering and
“NOSQL
NOSQL Database: New Era of Databases for Applications, Volume 4, No 3, April 2015.
Big Data Analytics- Classification, Comparison 7. Clarence J M Tauro, Aravindh S and Shreeharsha
and Characteristics”, International Journal of
A.B, “Comparative
Comparative Study of the New
Database Theory and application.
Generation, Agile, Scalable, High Performance
2. Aaron Schram and Kenneth M. Anderson, NOSQL Databases”, International Journal of
“MySQL
MySQL to NOSQL: Data Modelling Computer Applications, volume 48-No.
48 20, June
challenges In Supporting Scalability
Scalability”. 2012.
3. AmeyaNayak, Anil Poriya, and DikshayPoojary, High Performance Database
8. David Taniar, “High
th
“Types
Types of NOSQL Databases and its Processing”, 26 IEEE International Conference
comparison with the Relational Databases”, on Advanced Information Networking and
International Journal of Applied Information Applications, 2012.
Systems, Volume 5 No. 4, March 2013,
9. DrK.Chitra and B.Jeevarani, “Study
“ on Basically
www.ijais.org.
Available, Scalable, and Eventually Consistent
4. AnkitaBhatewara and KalyaniWaghmare, NOSQL Databases”, Volume3, Issue7, July
“Improving
Improving Network Scalability using NoSql 2013, www.ijarcsse.com.
Database”, International Journal of Advanced
10. Felix Gessert, Wolfram Wingerath, Steffen
Computer Research, Volume-2,
2, Number
Number-4, Issue- Friedrich, Norbert Ritter, “NoSQL
“ database
6, December-2012.
systems: a survey and decision guidance”,
guidance
5. Chad DeLoatch and Scott Blindt, ““NOSQL Springer, November 2016.
Databases: Scalable Cloud and Enterprise 11. GuoYubin, Zhang Liankuan, Lin Fengren, Li
Solutions”, August 2, 2012.
Ximing, “A
A Solution for Privacy-Preserving
Privacy
6. Chieh Ming Wu, Yin Fu Huang, John Lee, Data Manipulation and Query on NOSQL
“Comparisons
Comparisons between MongoDB and MS
MS-

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 621
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
database”, Journal of Computers, Vol 8, No. 6, Framework for Migrating Relational Datasets
June 2013. to NoSQL”,
”, Elsevier, Volume 51, 2015.
12. Hanen Abbes, FaiezGargouri, “Big Data 18. Liana Stanescu, Marius Brezovan, and Dumitru
Integration: a MongoDB Database and Dan Burdescu, “An An algorithm for mapping the
Modular Ontologies based Approach”, Elsevier, relational databases toMongodb
to – a case
September 2016. study”,International
International Journal of Computer Science
13. InduArora and andDrAnu Gupta, ““Cloud and Applications, January
2017,https://www.researchgate.net/publication/31
https://www.researchgate.net/publication/31
Databases: A Paradigm Shift in Databases”,
International Journal of Computer Science Issues, 8599517.
Vol 9, Issue 4, No. 3, July 2012, www.IJCSI.com 19. Lior Okman, Nurit Gal-Oz,
Oz, YaronGonen, Jenny
Abramov, “Security
Security Issues in NoSQL
14. IoannisKonstantinou, Evangelos Angelou,
Databases”,
”, International Joint Conference
Confere of
Christina Boumpouka, DimitriosTsoumakos,
IEEE TrustCom, 2011.
NectariosKoziris, “On
On the Elasticity of NOSQL
Databases over Cloud Management Platforms 20. Marin FOTACHE and Dragos COGEAN,
(extended version)”, ”, Computing Systems “NOSQL
NOSQL and SQL Databases for the Mobile
Laboratory,
tory, School of Electrical and Computer Applications. Case study:
Engineering National Technical University of MongoDBVsPostgreSQL”, Volume 17, No
Athens. 2/2013.
15. Joao Ricardo Lourenco, Bruno Cabral, Paulo 21. Nadeem Qaisar Mehmood, Rosario Culmone,
Carreiro, Marco Vieira, Jorge Bernardino, Leonardo Mostarda, “Modeling
Modeling temporal aspects
“Choosing
Choosing the right NoSQL database for of sensordata for MongoDBNoSQL database”,
database
thejob: a quality attribute evaluation” Journal Journal of Big Data, Springer, 2017.
of Big Data, Springer, 2015.
22. Naseer Ganiee, “New
New Database Constraints and
16. Katarina Grolinger, Wilson A Higashino1, Modern Applications”, IJLTEMAS, Volume III,
AbhinavTiwari,Miriam AM Capretz, ““Data Issue II, February 2014.
management in cloud environments: NOSQL: The Big Data
23. Naseer Ganiee, “NOSQL:
NoSQLand NewSQL data stores stores”Journal of
Solution”, International Journal
J of Advancement
Cloud Computing, Springer, 2013.
in Engineering Technology, Management and
17. Leonardo Rocha, Fernando Vale, E
Elder Cirilo, Applied Sciences, Volume 1, Issue 2, July 2014.
Darlinton Barbosa, FernandoMourao, ““A

@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 6 | Sep-Oct


Oct 2018 Page: 622

You might also like