Upload
stratio
View
239
Download
5
Embed Size (px)
DESCRIPTION
Stratio is a Big Data platform based on Spark. It is 100% open source and enterprise - ready. In Stratio we are Pure Spark, since it is the only technology in the market able to combine stored data analyses with real time streaming data, all in the same query. We are unique in integrating Spark processing with the main NoSql databases: Cassandra, MongoDB, ElasticSearch, ...
Citation preview
•
•
•
•
•
•
•
•
•
•
•
SELECT * FROM tweets WHERE lucene=
'{
filter :
{
type : "range",
field : "time",
lower : "2014/04/25",
upper : "2014/04/1"
},
query :
{
type : "phrase",
field : "body",
values : ["big", "data"]
},
sort :
{
fields: [ {field:"retweets”, reverse:true} ]
}
}';
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
CASSANDRA
Kafka
STRATIO DEEP
STRATIO DEEP
•
•
•
•
•
•
•
readClobreadCSVreadLinereadMultiLinereadAvroreadJson
addCurrentTimeaddLocalHostgeoIPfindReplaceSplit
generateUUIDdecompressIfextractJsonPathsdetectMimeType
xqueryextractURIComponentsxsltGrok (regular expressions)
exec
spooling SNMP
Kite SoftwareDevelopment Kit