[B! flink] [2ページ] kimutanskのブックマーク

Flink Forward SF 2017: Konstantinos Kloudas - Extending Flink’s Streaming APIs

kimutansk 2017/04/19

ProcessFunction、イベント受信時にTimer仕掛けて発動時に状態比較してクエリングして出力などをFlink側の機構に乗っかってできますか。

flink
stream

リンク

Flink forward-2017-netflix keystones-paas

kimutansk 2017/04/19

Flinkをコンテナ化してSPaaSとして使うパターン。CheckPointをS3に入れれば、死んでも復活させればそれで済む話ではありますが、ここまで大規模で自動化してやるのはすごい。

stream
flink

リンク

Continuous Queries on Dynamic Tables: Analyzing Data Streams with Streaming SQL - data Artisans

kimutansk 2017/04/11

端的にはDataflowの概念＋Kafka Streamsのクエリ可能状態＋Calsiteのグループ化ウィンドウを盛り込んだFlinkによる大規模ストリームアプリケーションが次バージョンで可能になると。楽しみですね。

stream
flink

リンク

Deep Dive of Flink & Spark on Amazon EMR - February Online Tech Talks

Organizations are demanding increasingly faster tools to process and analyze data in real time. Apache Spark and Apache Flink have emerged as popular, open source frameworks to address these requirements. In this tech talk, we provide an overview of these techno logies and the differences between them. We show how you can deploy Apache Spark and Flink on AWS to address common big data use cases suc

kimutansk 2017/03/07

Amazon EMRの方から元々Spark Streamingに時間使ってたりマイクロバッチでない限りもうFlinkでいいよという比較結果が。まぁ実際そうなんでしょうけど。

リンク

Fabian Hueske - Stream Analytics with SQL on Apache Flink

SQL is undoubtedly the most widely used language for data analytics for many good reasons. It is declarative, many database systems and query processors feature advanced query optimizers and highly efficient execution engines, and last but not least it is the standard that everybody knows and uses. With stream processing techno logy becoming mainstream a question arises: “Why isn’t SQL widely suppo

kimutansk 2017/02/21

StreamとTableの対応について、"Stream / Table Duality"、二重性と表現しますか。なるほど。

stream
flink

リンク

Stream Analytics with SQL on Apache Flink

Apache Flink's DataStream API is very expressive and gives users precise control over time and state. However, many applications do not require this level of expressiveness and can be implemented more concisely and easily with a domain-specific API. SQL is undoubtedly the most widely used language for data processing but usually applied in the domain of batch processing. Apache Flink features two

kimutansk 2017/02/14

Flinkでも明確にStream<>Dynamic Table(Update/AddRetract)という形で概念出て、無限にデータを溜められないとか出るあたり、このあたりはもう一般的と言っていいですかね

stream
flink

リンク

Apache Flink® Community Announces 1.2.0 Release - data Artisans

kimutansk 2017/02/07

内部状態一貫性保障レベルを下げずに動的スケール、Window関数の内部状態が使える機能追加、非同期IOのハンドリング強化、外部チェックポイント機構追加、内部状態のクエリ可能化、Mesos統合と。

stream
flink

リンク

Flink Streaming Windows – A Comprehensive Guide - DataFlair

kimutansk 2017/02/01

Windowingという観点から見たFlinkに対するまとめと。

stream
flink

リンク

Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink's Approach to It

kimutansk 2017/01/18

残るはSQLによる適用範囲の拡大と、動的な並列度調整の強化、後は非常に大きな状態値への対処ですか。

stream
flink

リンク

Aljoscha Krettek - Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Analytics

kimutansk 2016/12/20

ストリーム＞分散＞ステートフル＞EventTimeという流れ。時刻がずれる要因を挙げてEventTimeとProcessingTimeの違いを説明すると、なるほど。

stream
flink

リンク

Keynote: Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink's Approach to It

kimutansk 2016/12/20

１時間毎区切りのデータもStreamと言ってしまうのは実際そう言えますね。バッチは有限の区切られたストリームという。クエリ可能がやはりこちらでも来ますか。あとは巨大データへの対応と・・・

stream
flink

リンク

Apache FlinkのWindow周りAPIを紹介 - Qiita

Distributed computing (Apache Hadoop, Spark, ...) Advent Calendar 2016 15日目の記事です. Apache FlinkのStreaming APIにおけるWindow周りの機能を紹介してみます. https://ci.apache.org/projects/flink/flink-docs-release-1.1/apis/streaming/windows.html に書いてあるような内容です. ベースにしているFlinkのバージョンは1.1です. 例などはScalaで書きますが、Javaでも大体同じような感じです. はじめにストリーム処理では、データは終わること無く永遠に流れてきます. 従って、そこに対して計算を行うためには、何かしらの手段で計算対象のデータを区切ることが必要になってきます. この、永続的に流れるデ

kimutansk 2016/12/15

固定長WindowはGoogleですとFixedですが、FlinkやKinesisですとTumblingなのでそちらになる流れですかね。Windowの概念説明はわかりやすいです。

stream
flink

リンク

Data Stream Analytics - Why they are important

Streaming is cool and it can help us do quick analytics and make profit but what about tsunamis? This is a motivation talk presented at the SeRC Big Data Workshop in Sweden during spring 2016. It motivates the streaming paradigm and provides examples on Apache Flink.

kimutansk 2016/11/30

FlinkにSQLインタフェース、StreamML、StreamGraph処理、オートスケール、インクリメンタルスナップショットが来る予定と。インクリメンタル来ますか。

stream
flink

リンク

Stream Processing Myths Debunked - data Artisans

By @kostas_tzoumas and @wints Needless to say, we here at data Artisans spend a lot of time thinking about stream processing. Even cooler: we spend a lot of time helping others think about stream processing and how to apply streaming to data probl ems in their organizations. A good first step in this process is understanding misconceptions about the modern stream processing space (and as a rapidly-

kimutansk 2016/11/30

ストリーム処理の神話：6つの誤解と。最近このあたりまとまってきて、ストリーム処理が解析の域を超えて広がっているのはうれしいことですね。

stream
flink

リンク

Throughput, Latency, and Yahoo! Performance Benchmarks. Is there a winner? - DataTorrent

kimutansk 2016/11/15

13ComputeNode(XeonE5*2/24Core+252GBRAM) 3TB SATA(III) HDD*6&4BrokerNode(Kafka0.8.2)&10GbNICでRedis上のデータとJoin>10秒Windowing>Redis出力。それが270万レコード(288Bytes)/秒のスループットで概ね4秒遅延に収まると

リンク

Functional Comparison and Performance Evaluation of Streaming Frameworks

kimutansk 2016/10/12

各ストリーム処理基盤同士の比較資料。Gearpumpは比較的基盤部が薄いので性能は全般的に高いが、アプリを実現する際の機能としてFlink等に劣り、開発にも手間が大きいという傾向？

リンク

Flink How To: A Demo of Apache Flink with Docker » Big Data Europe

Flink How To: A Demo of Apache Flink with Docker September 29, 2016. Written by Gezim Sejdiu. Posted in Blog, Big Data, BDE-Techno logy This guide explains the steps of how to run a Flink application on the BDE platform. Apache Flink is an open-source platform for distributed stream and batch processing. In this post, we are going to see how to launch a Flink demo app in minutes, thanks to the Apac