International Journal on Recent and Innovation Trends in Computing and Communication

Volume: 3 Issue: 4

ISSN: 2321-8169
2132 - 2136

_______________________________________________________________________________________________

A Survey paper on Cloud Environment for Backup and Data Storage
Prof. Palash M. Gourshettiwar
Computer Science & Engineering
D.M.I.E.T.R, Sawangi(M), Wardha
Asst. Professor
Email_id-palash9477@gmail.com

Prof. Dhiraj Shirbhate
Computer Science & Engineering
J.D.I.E.T, Yavatmal
Asst. Professor
Email_id-shirbhate.dhiraj@gmail.com

Abstract – The use of the disks of the nodes of a cluster as worldwide stockpiling framework is a reasonable answer for a cloud situation. The
requirement for the accessible of data from anyplace is expanding; this speaks to an issue for some clients who use applicati ons, for example,
databases, media, individual document, records, and so forth. The I/O information requests of these applications get higher as they get bigger. So
as to enhance execution of these applications can utilize parallel document frameworks. PVFS2 is a free parallel record framework grew by a
multi-organization group of parallel I/O, systems administration and capacity specialists. In this overview of the configuration of an execution
for cloud environment for ready to store and move down information by utilizing remote servers that can be gotten to through the Internet. The
execution expects to expand the accessibility of information and lessen in loss of data.
Keywords - Cloud, Storage, backup, File System, Cluster, PVFS2.

__________________________________________________*****_________________________________________________
I.

INTRODUCTION

Because of the development of advances, diverse
administrations and applications that permit clients to
perform assignments that enhance efficiency in their every
day exercises expanded. So the need to get to whenever as
well as from a remote site has developed from a
hypothetical proposition an authentic need. This has offered
ascent to look for different options for take care of this
issue. One of the choices is known by the term distributed
computing; distributed computing can be characterized as
that administration (programming, stage or foundation)
situated on the Internet and is gotten to from a cell phone or
desktop PC, giving clients a wide assortment of utilizations
(databases, center office programming, stockpiling, and so
forth). The latest and acknowledged institutionalized
meaning of Cloud Computing is the one by the National
Institute of Standards and Technology (NIST) [1]
"Distributed computing is a model for empowering
universal, advantageous, on interest system access to an
imparted pool of configurable figuring assets that can be
quickly provisioned and discharged with negligible
administration exertion or administration supplier
association. This cloud model advances accessibility and is
made out of five key qualities, three administration models,
and four sending models." Others are more particular: "A
Cloud is a kind of parallel and conveyed framework
comprising of a gathering of between joined and virtualized
PCs that are rapidly provisioned and exhibited as one or
more brought together processing assets in view of Service
Level Agreements (SLAs) created through transaction
between the administration supplier and customers" [2].

Some Internet administration suppliers like Google, Amazon
AWS, Microsoft and others have constructed their own
particular framework to furnish the client with an open
cloud, this kind of cloud is kept up and worked by outsiders
not related with the association, therefore, both the
methodologies and information of different customers are
blended on servers, stockpiling frameworks and other
foundation of the cloud. For organizations that need high
privacy and information security, a choice are private mists.
This kind of cloud is a decent option. Administration is
conveyed by a customer that controls which applications
ought to run. Servers, system, and capacity gadgets are the
property of the association. So they can choose which
clients are permitted to utilize the base. There is one more
sort of cloud is known of half breed models join open and
private mists. In this model it claims and imparted segments
other in a controlled way. A system model where the
information is moved down and put away by utilizing a web
association on remote servers is known by the name of
distributed storage, by and large are facilitated by third
gatherings. Facilitating organizations work extensive server
farms, and individuals who require their information to be
facilitated purchase or lease stockpiling limit from them.
The server farm directors frequently us virtualization to hold
assets as indicated by client necessities and uncover them as
storage rooms to store records or information objects.
Physically, the asset can stretch out along numerous servers
and various areas. The security of the records relies on the
facilitating organizations, and on the applications that
influence the distributed storage. Numerous applications
work with a lot of information that need to be stacked or put
away on plate, for instance, database, media, individual
2132

IJRITCC | April 2015, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication
Volume: 3 Issue: 4

ISSN: 2321-8169
2132 - 2136

_______________________________________________________________________________________________
documents, and so forth. To enhance execution in plate
access PCs in the mid to high range are utilized I/O
frameworks taking into account SAN. Anyway, considering
the degree and utilization group with a specific end goal to
enhance the execution/expense are likewise being utilized
record frameworks that exploit the capacity hubs in the
bunch. This keeps the framework cost SAN. Among these
document frameworks that permit applications to perceive
how the entire stockpiling framework comprising of plates
of all hubs in the group, are Luster [3] and PVFS2. Both are
parallel record frameworks , that is, numerous hubs can
permit parallel access to the same document and a hub can
get to various bits of a document simultaneously. They
attain to this by circulating a record crosswise over diverse
circles. PVFS is a free parallel document framework for
Linux, now in its second form (PVFS2) that permits
exploiting the thing plates that as of now exist as an
essential piece of every hub in a common minimal effort
group. PVFS can stripes documents over all the I/O servers
to expand information data transfer capacity in parallel
project. This alternative maintains a strategic distance from
the requirement for lavish SAN. Here in this work, a cloud
situation is proposed to reinforcement and store information
on a private cloud utilizing PVFS2 like document
framework for capacity information to expand the execution
of these applications. This alternative permits data/ yield
parallel, so will decrease the entrance times to information.
On the customer end, a multiplatform application is
produced utilizing free programming that permits
information exchange quick and straightforward way.

specified. Yet, it is imperative to know how to really deal
with the reinforcement and capacity of records inside the
data/ yield. Right now there are record frameworks for cloud
situations [9,10, 11]. Panzura CloudFS record System [9] is
a document framework created from no place to give
incorporation cloud and NAS situations. It offers usefulness
straightforward to clients, as everybody can see the same
record from any area. It likewise permits information
imparting, without needing to erase the first record. Cloud
File System Oracle [10] is a document framework for
private cloud situations, intended to oversee broadly useful
record store outside of a prophet database over different
agent framework stages with one administration interface.
Excessively its firmly incorporated with the programmed
stockpiling administration highlights of the prophet
database. BlueSky[11], is a record framework for a cloud
situation; BlueSky utilizes four sorts of articles for speaking
to information and metadata. These items are totaled into
log sections for capacity. BlueSky gives standard POSIX
document framework semantics, including nuclear renames
and hard connections. Likewise utilizes 32 KB hinders
rather than ordinary circle document framework size like 4
KB to lessen overhead. One target of this work is to build up
a multiplatform application that serves as capacity and
reinforcement environment in the cloud, we chose as a
document framework PVFS2 for our cloud surroundings; to
be free and open source, we have the flexibility to utilize
and adjust as indicated by our needs. It likewise offers
accessibility, adaptability and general incredible execution
when composing to or perusing from the I/O servers.

II. RELATED WORKS

III. PVFS2 OVERVIEW

Distributed computing and distributed storage have turned
into the favored system for conveying data and online
usefulness. While some cloud administrations concentrate
on giving purchasers an extensive variety of administrations
and functionalities. Others give distributed storage to
customers to free or charge some sort of membership based
fee[4,5,6,7,8,] as Windows Azure [4], is an open cloud stage
in a worldwide system of server farms run by Microsoft.
Dropbox [6], is a document facilitating administration
worked that
offers distributed storage, record
synchronization, and customer programming. Dropbox
permits clients to make a unique envelope on each of their
PCs, which Dropbox then synchronizes with the goal that it
seems, by all accounts, to be the same organizer paying little
heed to which PC is utilized to view it. Some of them have a
couple of a larger number of highlights than the others, and
synchronize over numerous gadgets or administration of
documents and reinforcements through cell phones. As such,
a few highlights of the application-level cloud and the
advantages to be picked up by paying for them have been

The Parallel Virtual File System task is a multi-organization
community push to plan and execute a creation parallel
record framework for HPC applications [12], [13]. The
second PVFS variant, PVFS2, is an augmentation of the
first that enhances measured quality and adaptability among
modules, and furnishes a solid joining with MPI-IO. The
segments of a disseminated record in PVFS are: N lumps of
document information, one metafile with record qualities,
and one registry entrance. PVFS stripes a solitary record
over the I/O or information servers. Every record will have
N datafiles, one on every information server, with a lump (a
few stripes) of the information in the record. The 64 bits
descriptor used to allude a datafile is a datahandle. The
rundown of all the datahandle of a record and its qualities
are kept up in a metafile on a metadata server. Metafile has
likewise a metahandle that speaks to it. The guardian
registry of the record can be on another metadata server.

2133
IJRITCC | April 2015, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication
Volume: 3 Issue: 4

ISSN: 2321-8169
2132 - 2136

_______________________________________________________________________________________________
IV. DESIGN A CLOUD ENVIRONMENT
In the outlined private cloud for reinforcement and
information stockpiling will create and execute an
application that naturally synchronizes all data moved down
or put away by the client in the virtual organizer to the
cloud. In the cloud environment, physical envelopes were
situated on a mounted plate space PVFS2 servers. PVFS2
stripes records over the various information servers. Of
course, it uses round-robin and squares of 64 KB. Every
information server stores the pieces of a PVFS record in a
neighborhood Linux document, called datafile. An I/O
operation can cooperate with various information servers to
peruse and compose a lump of information. The application
is cross-stage and will synchronize with one or more client
characterized gadgets. Access to the information from the
gadgets or be controlled web program by approving client
qualifications like username and secret key. A few
highlights that will have the capacity to oversee from the
customer program and is made through a desktop
application are:
1.Synchronize envelope or change the season of
synchronization. It's truly intriguing to permit the client to
make when you need the information to be sent to the cloud,
so you can keep away from system blockage.
2.Create or erase client qualifications.
3.Change correspondence ports.
The administration server for the application will be through
a web interface. Utilizing this interface you can oversee
clients, change correspondence ports, courses information
stockpiling, reinforcement creation and stacking.
A. Programming dialect and coordinated advancement
environment (IDE)
The improvement of the application is made in Java
programming dialect, article arranged and intended to work
in systems, which likewise has two Oracle compilers, the
authority compiler of Oracle JDK and Open JDK group
authorized under the GPL, additionally has different
coordinated advancement situations (IDE), the most utilized
find NetBeans IDE as a part of which work and which was
created by Sun Microsystems now known as Oracle and
discharged under GPL. Other IDE is Eclipse, grown by the
Eclipse Foundation and discharged under EPL permit. Made
with no benefit by a consortium of organizations drove by
IBM. Both are IDE's multiplatform and good with other
programming dialects, ought to be noticed that such
applications permits source code refactoring, which speeds
the improvement of utilizations.

B. Operation customer application
The customer application will make a flush of all records
and indexes that are in the envelope characterized for
synchronization alongside their metadata. as it is indicated
in Fig. 1 So you may have the accompanying cases:
1)Customer: If the neighborhood document date is not
exactly or does not exist in the nearby organizer continues to
download a duplicate of the record to the neighborhood
envelope synchronization, if checked as erased continue to
erase the record from the envelope in client the cloud.
2) Cloud: If the record does not exist in the organizer
characterized in the cloud or have an adjustment date not
exactly the current, we continue to supplant the document
from the cloud by the most recent customer rendition.
The getting data from the records and envelopes will be
made utilizing the File class as a part of Java.
Correspondence and data sent between applications will be
made through Sockets utilizing the classes as a part of the
java.net bundle utilizing TCP. Records will be serialized and
sent as series of bytes, probably such byte strings encoded
with DES1 encryption strategies or TDES2. This to offer
more security and protection devices records are sent to the
cloud, once these byte streams coming to the server can
unscramble the store in client characterized space. The
customer application will likewise be able to make, erase
and supplant registries and/ or documents from encoded
byte streams got and/ or asked for from the server
application.
This application will have desktop interfaces that permit
simple administration of inclination. For example, organizer
synchronization, sync time setting, access qualifications
administration, characterizing association parameters.
Despite the fact that application is as of now being produced
for PC structure configuration is considering utilizing a
touch screen, this to get the assets that are as of now offer a
few models of portable PCs and PCs all in one that are
available today, likewise, later on to encourage the
movement of this application to a versatile working
framework like Android.

2134
IJRITCC | April 2015, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication
Volume: 3 Issue: 4

ISSN: 2321-8169
2132 - 2136

_______________________________________________________________________________________________
V. CONCLUSION

Fig. 1 Sending data from the customer to the server
program
C. Operation server application
The server application will contain one string every client,
gave by the Thread class in Java, which through Sockets
application will contact the customer with whom offer
information and metadata, contingent upon the guidelines.
The application server will get information and metadata as
series of bytes scrambled the unscramble, compose, supplant
or erase as pertinent, furthermore send information and
metadata in encoded byte streams to the customer
application. This application will likewise include a web
application which can see and download the records in the
index of every client, and straightforward steps can make
the client record, and redesign individual data and secret
word. Metadata that will keep are: record name, size, way,
last date changed, erased imprint, this data is put away in a
MySQL table. View in Fig. 2.

The analysis demonstrate the predominance that exists on a
nearby document framework contrasted with a parallel
record framework where information is gotten to remotely.
Then again, PVFS2 enhances results to the usage of diverse
I/O servers, in this way diminishing the distinction in
execution in the middle of PVFS2 and EXT3. This will
legitimize the proposition to execute document framework
PVFS2 for a cloud domain for reinforcement and
information stockpiling. The point is to get better execution
with the consideration of PVFS2, on the grounds that it
diminishes the information access inertness, lessening
system activity and the information is conveyed crosswise
over distinctive I/O servers. This permit information be
circulated instead of be incorporated, forestalling complete
loss of information. At last this usage is being produced to
give clients the experience of executing and dealing with a
private cloud environment that encourages the
reinforcement and information stockpiling, utilizing
foundation effectively accessible or minimal effort. Staying
away from installment of charges or participations needed to
get this administration. This permits full control of the
individuals who access the data, so keeping up the
classifiedness of the information.
REFERENCES
[1]

[2]

[3]
[4]
[5]

[6]
[7]
[8]
[9]
[10]

[11]

Fig. 2. Depiction of customer and server applications

Mohammad Hamdaqa, Ladan Tahvildari, Cloud
Computing Uncovered: A Research Landscape, In: Ali
Hurson and Atif Memon, Editor(s), Advances in
Computers, Elsevier, 2012, Volume 86, Pages 41-85,
ISSN
0065-2458,
ISBN
9780123965356,
http://dx.doi.org/10.1016/B978-0- 12-396535-6.00002-8.
Rajkumar Buyya, Chee Shin Yeo, and Srikumar,
Venugopal. “Marketoriented cloud computing: Vision,
hype, and reality for delivering it services as computing
utilities”. CoRR, (abs/0808.3558), 2008.
P. J. Braam, "The Lustre Storage Architecture,"
November. 2002.
Windows Azure : http://www.windowsazure.com/es-es/
Hybrid
Cloud:
http://www.redhat.com/products/cloudcomputing/
cloudforms/
Dropbox: https://www.dropbox.com
SkyDrive:
http://windows.microsoft.com/eses/skydrive/download
GoogleDrive:
https://support.google.com/drive/answer/2424384
Panzura CloudFS file system (White paper)
http://panzura.com/products/global-file-system/
Oracle
Cloud
File
System
(White
paper):
http://www.oracle.com/us/products/database/cloudfilesystem/ overview/index.html
Michael Vrable_, Stefan Savage, and Geoffrey M.
Voelker, "BlueSky: a Cloud-Backed File System for the
2135

IJRITCC | April 2015, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication
Volume: 3 Issue: 4

ISSN: 2321-8169
2132 - 2136

_______________________________________________________________________________________________

[12]

[13]

[14]
[15]

[16]

Enterprise" Proceeding of th 10th USENIX Conference
on File and Storage Technologies, February 14-17, 2012.
ISBN 978-1-931971-91-1
Philip Carns, Sam Lang, Robert Ross, Murali
Vilayannur, Julian Kunkel and Thomas Ludwig. (2009,
04-2009). Small-file access in parallel file systems.
R. Latham, N. Miller, R. Ross and P. Carns, "A NextGeneration Parallel File System for Linux Clusters,"
LinuxWorld, vol. 2, January, 2004.
J. M. Kunkel and T. Ludwig, "Performance evaluation of
the PVFS2 architecture," in 2007, pp. 509-516.
Camacho, H.E.; Nieto, E.; Anguita, M.; Díaz, A.F.;
Ortega, J., "Client cache for PVFS2," Parallel Distributed
and Grid Computing (PDGC), 2010 1st International
Conference on , vol., no., pp.38,43, 28-30 Oct. 2010. doi:
10.1109/PDGC.2010.5679607
Nieto, E.; Camacho, H.E.; Anguita, M.; Díaz, A.F.;
Ortega, J., "Fault tolerant PVFS2 based on data
replication," Parallel Distributed and Grid Computing
(PDGC), 2010 1st International Conference on , vol., no.,
pp.107,112,
28-30
Oct.
2010
doi:
10.1109/PDGC.2010.5679880

2136
IJRITCC | April 2015, Available @ http://www.ijritcc.org

_______________________________________________________________________________________