Microsoft adds Hadoop for Large Data Processing
If you are an open source user of database structures you are probably familiar with Hadoop. That is the cloud database framework that scales huge datasets over multiple clusters, distributed under the Apache v2 open source license. Well now Microsoft has incorporated the program into its server and cloud system technologies.
Users of Hadoop
Processing large amounts of data is the hallmark of Hadoop. One example is Twitter. They must process 12 terabytes of data on a daily basis, and so they use Hadoop to make the processing possible. For Twitter the problem is that they need to store more data per day than can reliably be written to a single hard drive, so it needs to store data on clusters. Twitter uses Cloudera’s Hadoop distribution to power its clusters.
Klout is another enterprise that uses Hadoop, but is moving to Microsoft. Dave Marianni of Klout said,”“I would rather go open source, that’s my first choice always. But when it comes to open source, scalable analysis tools, they just don’t exist yet.” so instead Klout’s stack is switching to a service which can meet its analytical needs. The data may be stored in Hadoop but it is hard to use effectively, so they are turning to Microsoft which can provide the data analysis needed.
Klout uses Hadoop for its data operations in Palms Hotel in Las Vegas and it processes one billion pieces of user data every day.
Cloud Data Source
Where Hadoop is making a big impact is in the area that it naturally falls into which is providing services for the cloud, and Microsoft is taking advantage of that. As the growth of Azure cloud services grows, so does the need to manage that data efficiently. Microsoft’s plan is to make Hadoop data accessible through Windows Azure cloud-based deployments, as well as enable it to be analyzed with the company’s Business Intelligence tools.
Since last October, Microsoft has been working with Hortonworks on building distributions of Hadoop for both Windows Server and Windows Azure. How soon that will be available to all is not clear, and there is some talk that it may be added as a server role as well.
Microsoft adds Hadoop for Large Data Processing was posted on Everything Microsoft – Latest Microsoft News, Guides, Reviews & Themes. If you are not reading this content in an email newsletter, it is being used without permission.
Leave a Reply
You must be logged in to post a comment.