Tag Archive for cloud storage

Steve Jobs – an Object Storage Champion?

The recent launch of Mountain Lion, updates to iCloud and continued proliferation of iOS devices is giving everyone a view of what a file system-less world will look like (and it looks pretty darn good to us at Caringo). Of particular interest is what Steve Jobs had to say all the way back in 2005 and how, with all of the recent software and hardware launches, Apple is rapidly removing traditional file system conventions from their product line. Why? It can be summarized in two words – simplicity and portability. The subject was discussed a bit further in Fortune Mag’s blog – Steve Jobs: “Why is the file system the face of the OS?”. Job’s exact words were:

in every user interface study we’ve ever done […], [we found] it’s pretty easy to learn how to use these things ’til you hit the file system and then the learning curve goes vertical. So you ask yourself, why is the file system the face of the OS? Wouldn’t it be better if there was a better way to find stuff?

What Job’s was really highlighting is the way files are stored and addressed. In a traditional file system you store files on a device, then a volume (C:, D:…), then in a directory or bunch directories, then with a name and file type.

File systems = complexity

What goes on behind the scenes (in a corporate filer) is even more complex with each file being split into thousands of blocks each addressed with an inode and then stored in different places on a disk. It is all this complexity that ultimately leads to what Jobs described as “learning curve going vertically”. Users need to remember where the files are, the name of the file, to back up the file and in a corporate world, it gets even more complicated. But the real issue is that file systems have limitations and in today’s “cloudy” world they lock files into a single physical location making data portability extremely difficult.

File system don’t = portability

The way that file systems address content also locks it into a particular location making it very difficult to manage content portability across devices and platforms. Think about the opposite action of retrieving a file through a file system. You need to know the entire path (\\serve\volume\directory..\filename.ext). Again what goes on behind the scenes is even more complex. Every inode is looked up, the blocks that represent a file are all found on the disc and stitched back together and the file is delivered. Now think about this for thousands, millions or billions of files on different devices all trying to be synchronized. It is just way too complex to do it in a practical fashion.

So… what is Apple doing?

What Apple is doing is leveraging a key/value method of storage and addressing which is the same in most object storage solutions. What this does is assign a unique value on content which is stored with meta data that can be presented to give more information on what is in the content and how it should be managed. They can then store everything in one big pool of storage and access that piece of content with only that one key/value. So the application doesn’t need to know the exact location at all times and doesn’t need to stich everything together. This also dramatically improves storage management and access. Instead of trying to manage a long directory path and thousands of inode addresses, applications only need to manage a single value and metadata which can all be synched to the same repository (like iCloud).

User-friendly storage (for end users and admins)

We leverage these principals in the Object Storage Platform and like Apple, think storage should be simplified to the extent that IT administrators don’t think about it and are free to focus on what is core to their business. So we agree with Job’s comments and want to extend it a bit to ‘why is the file system the face of the OS, and why is it the face of storage’. There is an easier and better way to store content that you are most likely already benefiting from today and it is easier than you think to setup and use throughout your organization.

DiVault, Archiving in the Cloud powered by Caringo

DiVault, an initiative of R5 Projects B.V. and EMID Consult B.V., has launched a new digital archiving in the cloud service powered by Caringo software.  The new service uses the Caringo Object Storage Platform with commodity hardware to guarantee the continuity, safety and security of data for organizations looking for a better way to structure, organize, store, search and retrieve their growing volume of digital information.

Targeting those in the government, industry, education, and financial markets, the new cloud-based service allows customers to store large quantities of data in data centers throughout the Netherlands to cost-effectively archive and retrieve business-critical information.  Caringo software delivers a secure, multi-tenant system utilizing a single namespace that can scale to billions of objects and petabytes per location with superb performance and guaranteed data protection.

Three of the key benefits that Caringo provides are 1) the ability to keep the solution responsive regardless of the number of files/objects or file size (4KB to 4TB+), 2) vendor neutral, plug and play scale out with no provisioning and automated storage balancing, and 3) a symmetric architecture with automated data integrity checking, self healing and rapid recovery that actually gets faster as the service grows.

So what does this mean for cloud storage service management staff? Well, it means that one IT administrator can manage over 10 PB of storage AND they can easily keep up with hardware power consumption and capacity improvements. We are proud to add DiVault to our expanding list of customers and look forward to their continued success.

Caringo Used as Cloud Infrastructure for Medical Record Cloud

Our friends at the the Register posted a great article today giving an overview on how Caringo, VMWare and Intel are playing a key role in what will become the central, secure storage repository of digital records for 330 million people as part of a medical imaging cloud for patients across the United States.  The solution is a project developed through a partnership between John Hopkins University and Harris Corporation called Peake Healthcare Innovations.

The medical record cloud called PeakeSecure, uses our object storage platform as cloud storage infrastructure to automatically protect hundreds of millions of files each with 3 replicas. When dealing with an archive of this size drive failure is assured but by using replicas our software can reliably protect billions of files and hundreds of petabytes of records. This simply isn’t possible with RAID 5 or 6 because of performance issues associated with striping and rebuild times that increase as capacity increases.

PeakeSecure also uses our adaptive power consumption technology (Darkive), which allows CPU and the disks within the array to spin up or down as access patterns to data changes, resulting in a huge savings on operational and energy costs. Darkive is especially useful for cost savings in the medical industry where records need to be stored for a minimum of 7 years and are rarely accessed after a patient is treated. We have some customers that store children’s medical records that plan to store them indefinitely – imagine the cost if those discs kept spinning forever.

The solution has already been successfully tested in a private cloud utilized by two Johns Hopkins hospitals with a full version rolling out to its university hospital system in March.  The public cloud version is scheduled to be completed later this year.

5 reasons Object Storage will take over the cloud in 2012

1. Object Storage will bring Cloud economics to any organization in 2012

A key to the affordability delivered with cloud storage is to begin with commodity components in a redundant, multi-tenant architecture with highly efficient disk utilization. These benefits are driven by object storage infrastructure which most assume is due to economies of scale and proprietary development, however advancements in object storage including significant progress in standardization of interfaces and a maturing ISV ecosystem, are making object storage one of the best options for organizations looking to cost effectively store their unstructured data.

2. The Relentless Growth of Data Will Expose the Limitations of SAN and NAS
The complexities and costs of traditional NAS and SAN storage arrays will continue to become increasingly prohibitive as a viable long-term storage option for unstructured data. Information created by organizations will need to be stored and accessible for indefinite periods of time driven by the ability to re-use and by regulatory compliance purposes. Object storage will be one of the few ways that organizations can easily and cost effectively store massive libraries of information that are instantly accessible.

3. Object Storage will enable the Adoption of Hybrid and Private Cloud 
Organizations will increase their adoption of hybrid and private cloud storage solutions as IT departments look to solve management and cost issues associated with data growth. The move to hybrid and private cloud solutions will be based primarily on the ability to guarantee the security and integrity of their information while still benefiting from cloud economics.

4. Object Storage Will Help with File Count in Addition to Storage Capacity
In addition to an increase in capacity organizations are also seeing an increase in number of files driven by Big Data applications, research equipment, and continued optimization of web delivered information. These files range from a few KBs to tens of TBs and are exposing the limitations of file system. As the number of files increase file systems become less responsive requiring IT to purchase new storage systems to increase performance and responsiveness instead of capacity. Organizations will turn to object storage to provide a flat and highly efficient address space with no limit in file count or capacity.

5. Data Growth Will Prohibit Backups
As data sets grow backup windows get longer and longer until they ultimately become unmanageable and IT must decide what data to backup or even worse, not backup at all. Object storage will be turned to as organizations realize that they can use file replication and integrated self-healing, self-optimization and metadata driven data lifecycle management to eliminate backups altogether.