Hello,
Could you please give few more specifics? For example,
1. What is the domain for this? e.g. automotive, BFSI, Retails etc.
2. What is the size of the data and volume? Looking for real-time stream processing or off line processing?
3. What is the Cluster size you plan to run this? Just demo site or enterprise site with hadoop clusters already in place?
More details, such as above, helps us to clearly understand the requirement and tune the solution to your exact needs.
Myself, I am Gopalakrishna Palem, a well technology management & strategy consultant specialized in big-data, and Predictive-Analytics. Microsoft and Oracle are few of my clients I helped build system, as can be seen from my linked-in page and my other blogs. (Please search for my name Gopalakrishna Palem on google to learn more about my work on big-data).
Let us know your views. Thank you,
GK