Philip Howard from Bloor Research has posted some observations on his Blog entitled "CEP and Big Data 2". Here are some comments (actually nothing new - just summarizing things I have written about before).
Philip deals with three issues:
Philip deals with three issues:
- whether the name CEP is appropriate or should be changed?
- who should be credited as the pioneer of this area?
- whether CEP implies real-time processing?
- who are the CEP big data platforms?
Here are summary of my views on each of this topics.
The name "Complex Event Processing"
Exactly four years ago I posted on this Blog an explanation about - "why I prefer to use the name event processing without any prefix, infix or suffix". My particular dislike of the term "complex event processing" stems from the ambiguity in the name - some people (including David Luckham who coined this term) view it as processing of complex events, some interpret it as complex processing of events, and then debate of when something is complex enough, and what type of complexity is needed to qualify as CEP. Moreover some of the vendors use this term for products that are neither of the two options. I think that two words is enough for the name of a discipline, examples: information retrieval, machine learning, image processing and much more.... Thus, from my point of view the term "event processing" subsumes all other terms like complex event processing, business event processing, event stream processing and more.
Who gets the pioneering credit
Philip as a good UK patriot wonders why the Wikipedia value about Wikipedia and other sources gives credit to David Luckham and forget the Apama work that came from Cambridge UK. Looking at Wikipedia, it has one mention of David, as well as other references (like our EPIA book). It indeed does not mention Apama or any paper by John Bates, but being a Wikipedia, anybody can suggest additions.
David Luckham had major influence on this area, since he was the first one who published a full book and exposed the young area to the general public. An article in IEEE Computer, published in 2009, made some investigation of the history of that area and determined that in the 1990-ies there were four parallel projects that can be classified as starting points in this area: David Luckham's project in Stanford, John Bates' project in Cambridge (UK, not Boston), Mani Chandy in Cal Tech, and our Amit project in IBM Haifa Research Lab. I share Philip's view that John Bates should have full credit as one of the pioneers, and still view David Luckham as the "elder statesman" of the community.
Is CEP necessarily associated with real-time?
I have written several times about this topic, last time in response to Chris Carlson, to whom Philip also responds. There is some abuse of the term real-time in the industry, while its meaning is "within time constraints", many people interpret it as "with very low latency". This is not the same, anyway, event processing is a functionality with applications that require very low latency, applications which require to react within real-time constraints (which can be: 2 hours), some require both, and some require none.
Who are the CEP big data platforms?
I have taken upon myself the limitation not to state opinions on commercial products within this Blog - leaving it to analysts. Thus will make one comment. There is distinction between two types of software entities -
which is sometimes confused in the language used by people.
This is similar to the difference between an application server and a single component (programming in the small vs. programming in the large). Some of the available platforms for "event processing for big data" provide the first one -- it gives infrastructure, but not implementing any type of functionality, but enabling developers to create their own functionality, thus they don't do full-fledged event processing. Seems that many people classify both under the same classification (of course there are products that do both).