Dev Breakthroughs
68.9K views | +0 today
Follow
Dev Breakthroughs
Monitoring innovations in database, PHP, JS, RIA, HTML5, mobile and agile dev strategies & tools
Curated by Nicolas Weil
Your new post is loading...
Your new post is loading...
Scooped by Nicolas Weil
Scoop.it!

The Netflix Tech Blog: STAASH - STorage As A Service over Http - A multi-storage abstraction layer

The Netflix Tech Blog: STAASH - STorage As A Service over Http - A multi-storage abstraction layer | Dev Breakthroughs | Scoop.it
Netflix’s Astyanax project, and the recipes contained therein, have been a tremendous tool in helping Java developers adopt Apache Cassandra (C*), both within Netflix and outside of Netflix.  A common request we have gotten from non-Java, or non-JVM based (eg: Python, Ruby, BASH, JavaScript), developers is that they would like to take advantage of the recipes provided in Astyanax such as All-Rows-Query or Chunked-Object-Store.


STAASH’s short-term goal is to recreate the most popular Astyanax recipes as a service, providing a REST based API to Cassandra. This would allow ANY language to consume those recipes.
No comment yet.
Scooped by Nicolas Weil
Scoop.it!

The NoSQL Landscape in Diagrams

The NoSQL Landscape in Diagrams | Dev Breakthroughs | Scoop.it

Here’s the NoSQL landscape in 3 slides.

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

SQLifying NoSQL – Are ORM tools relevant to NoSQL?

SQLifying NoSQL – Are ORM tools relevant to NoSQL? | Dev Breakthroughs | Scoop.it

 

If you reached this page, it’s fair to assume that you must have worked on at least one relational database in your lifetime. They have been in use for a quarter of a century and are found in almost all business applications.

 

But, NoSQL databases are gaining traction these days. they are often called “Not only SQL” databases. It’s an umbrella term for a loosely defined class of non-relational data-stores.

They exhibit following main characteristics:

- They don’t use SQL as their query language.
- They may not give full ACID guarantees.
- They have distributed, fault-tolerant architecture.


In this article, I am going to explore whether ORM tools (whatever they are) make sense in NoSQL world…and whether they will be able to solve problems that are NoSQL specific. Next we’ll delve into approaches and challenges in making such a tool. This article assumes you are already familiar with and have worked on one (and possibly more) NoSQL database

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

Node.js and MongoDB – A match made in heaven

On Tuesday I did a presentation for the Pittsburgh JavaScript Developers Meetup group about using Node.js with NoSQL – or document based – databases. I chose MongoDB because it’s got great traction and there’s a Node module for it named Mongoose.

 

Unfortunately, I was on vacation the week before the presentation, so I didn’t get to cover absolutely everything that I wanted to, but I was able to show some great highlights of what can be done with Mongoose. When I first started to look in to it, I was immediately attracted to its syntax and API being very JavaScript friendly.

 

First things first, though. You can install Mongoose easily by using npm – npm install mongoose.

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison

While SQL databases are insanely useful tools, their monopoly of ~15 years is coming to an end. And it was just time: I can't even count the things that were forced into relational databases, but never really fitted them.

 

But the differences between "NoSQL" databases are much bigger than it ever was between one SQL database and another. This means that it is a bigger responsibility on software architects to choose the appropriate one for a project right at the beginning.

In this light, here is a comparison of Cassandra, Mongodb, CouchDB, Redis, Riak, Membase, Neo4j and HBase:

 

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

Amazon's streaming data service, Kinesis, is now available

Amazon's streaming data service, Kinesis, is now available | Dev Breakthroughs | Scoop.it

Kinesis, the streaming data service that Amazon Web Services announced in November, is now publicly available. The service is comparable in theory to popular open source technologies such as Apache Storm, only Kinesis brings with the fully managed experience that’s becoming par for the course within AWS.

 

Like Storm, Kinesis can process data in real time before shipping it into another data store — most likely Elastic MapReduce, Redshift or DynamoDB within the AWS platform. Unlike Storm, however, Kinesis can maintain data for up to 24 hours and isautomatically scalable up to hundreds of terabytes per hour via a software development kit, or SDK. Kinesis does include a connector for porting data to Storm, which AWS General Manager for Data Science Matt Wood said is a possibility in cases where existing Storm users want to keep using it for processing data while automating the collection with Kinesis.

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

Distributed Algorithms in NoSQL Databases

Distributed Algorithms in NoSQL Databases | Dev Breakthroughs | Scoop.it

Scalability is one of the main drivers of the NoSQL movement. As such, it encompasses distributed system coordination, failover, resource management and many other capabilities. It sounds like a big umbrella, and it is. Although it can hardly be said that NoSQL movement brought fundamentally new techniques into distributed data processing, it triggered an avalanche of practical studies and real-life trials of different combinations of protocols and algorithms. These developments gradually highlight a system of relevant database building blocks with proven practical efficiency. In this article I’m trying to provide more or less systematic description of techniques related to distributed operations in NoSQL databases.


In the rest of this article we study a number of distributed activities like replication of failure detection that could happen in a database. These activities, highlighted in bold below, are grouped into three major sections:

- Data Consistency

- Data Placement

- System Coordination

김필수's curator insight, January 15, 2015 11:03 PM
logic
Mercy Ospina's curator insight, March 7, 2016 4:38 PM

Excelente compendio y clasificación de los Algoritmos distribuidos más usados por NoSQL

Scooped by Nicolas Weil
Scoop.it!

MongoDB vs. RDBMS Schema Design

MongoDB vs. RDBMS Schema Design | Dev Breakthroughs | Scoop.it

This article explains how MongoDB schema differs from an equivalent RDBMS schema, and how common relationships between entities are replicated in MongoDB.

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

LexisNexis open sources code for Hadoop alternative

LexisNexis open sources code for Hadoop alternative | Dev Breakthroughs | Scoop.it

HPCC Systems, the division of LexisNexis Risk Solutions dedicated to big data, has released the open source code of its data-processing-and-delivery software it’s positioning as a better version of Hadoop. The High Performance Computing Cluster code is available on Github, and it marks the commencement of HPCC Systems’ quest to build a community of developers underneath Hadoop’s expansive shadow.

 

The HPCC architecture includes the Thor Data Refinery Cluster and the Roxy Rapid Data Delivery Cluster. As I explained when covering the HPCC Systems launch in June, “Thor — so named for its hammer-like approach to solving the problem — crunches, analyzes and indexes huge amounts of data a la Hadoop. Roxie, on the other hand, is more like a traditional relational database or database warehouse that even can serve transactions to a web front end.” Both tools leverage the company’s Enterprise Control Language, which Escalante describes as easier, faster and more efficient than Hadoop MapReduce.

 

Aside from the open source Community version, HPCC Systems also offers a paid Enterprise version of the HPCC product. The core code is the same, Escalante explained, with the major differences being additional enterprise-grade capabilities such as management tools and support and services.

No comment yet.
Scooped by Nicolas Weil
Scoop.it!

Data powers a new breed of startup

Data powers a new breed of startup | Dev Breakthroughs | Scoop.it

A new breed of startup is emerging, built to take advantage of the rising tides of data across a variety of verticals and the maturing ecosystem of tools for its large-scale analysis.

 

These are data startups, and they are the sumo wrestlers on the startup stage. The weight of data is a source of their competitive advantage. But like their sumo mentors, size alone is not enough. The most successful of data startups must be fast (with data), big (with analytics), and focused (with services).

No comment yet.