Meet Your AI Copilot fot Data Learn More
Your AI Copilot for Data
Kyligence Zen Kyligence Zen
Kyligence Enterprise Kyligence Enterprise
Metrics Platform
OLAP Platform
Customers
Definitive Guide to Decision Intelligence
Recommended
Resources
Apache Kylin
About
Partners
On August 1st, Kyligence presented an Apache Kylin tech talk at a Meetup hosted by Big Data Bellevue.
For those unfamiliar, Big Data Bellevue is an active Meetup group in Seattle’s Eastside area, run by Siddharth Agrawal and Chaitanya Dabke. Its members are technology professionals from Seattle, Bellevue, Redmond, Kirkland, and other neighboring cities. Over the past five years, it has produced many excellent sessions focused on Big Data technologies. We were very excited to have a chance to present at this Meetup.
Architects and engineers from eBay, T-Mobile, Amazon, Microsoft, and other major organizations attended the talk. Daniel Gu, VP of the Americas at Kyligence, and Shaofeng Shi, Kyligence’s Chief Architect, delivered the presentation. The discussion started with Daniel, an eBay veteran who oversaw the company’s global analytics platform. He recounted the story of how the business’ need to analyze millions of transactions interactively gave birth to a new analytics project. That project would eventually go on to become Apache Kylin.
Daniel shared his view of the evolution of data processing over the past two decades and busted the myth that OLAP is dead. A great overview of this can be found in his blog article on the subject: OLAP Analytics is Dead. Really?.
Shaofeng Shi is one of the original creators of the Apache Kylin project, an Apache Kylin PMC member, and one of Kyligence’s founding engineers. He has been an evangelist for Apache Kylin over the past few years, giving speeches at various open source conferences around the world. At Kyligence, Shaofeng helps the company’s largest customers setup analytics architectures that processes trillions of rows of data while servicing 100,000+ active users.
Shaofeng’s portion of the talk introduced the latest addition to Apache Kylin 3.0, Real-Time Analytics, which is scheduled for GA later this year.
Traditionally, the Kylin OLAP engine was designed to process historical data stored in data lakes in a format such as Hive tables. Near real-time processing was added in Kylin 1.5. Near real-time processing reads data from stream data sources like Kafka and updates the cube in a mini-batch fashion. Previously, the delay of Kylin’s near real-time processing was at around one minute. This is not enough for certain use cases such as fraud alert in financial transactions.
In Kylin 3.0, aggregation data (cubes) are stored in real-time servers and/or historical servers (see diagram below). The data query request is divided into two parts according to the Timestamp Partition Column. The query request of the latest time period will be sent to the real-time node, and the query request for historical data will still be sent to the HBase region server.
The query server needs to merge the results of both and return it to the client. At the same time, the real-time node will continuously upload the local data to the HDFS. When a certain condition is met, the segment will be built by MapReduce, thereby realizing the conversion of the real-time part to the historical part and achieving the purpose of reducing the pressure on the real-time computing node.
With real-time processing capabilities, Kylin can now serve multidimensional analysis for both historical and real-time data. This opens doors for many use cases in financial services, IOT, healthcare, retail, ad tech, and more major industries.
Customers can now process all of their analytical queries with one technology – Kylin. This greatly simplifies the technology architecture and improves productivity and accuracy of analysis. For more details about Kylin real-time analysis, please visit this page.
The Big Data Bellevue Meetup was a wonderful opportunity and we appreciate having had the chance to share Kylin’s story and connect with so many great members of the Washington Big Data community. We look forward to attending again in the future. If you’d like for us to speak at your next Meetup or community event, feel free to contact us.
Also be sure to follow us on Twitter and LinkedIn for the latest updates regarding Apache Kylin, Kyligence Enterprise, Kyligence Cloud, and Kyligence Insight.
The driving force behind Meituan’s success is not simply a robust analytics system, but the OLAP engine that system is built upon - Apache Kylin.
Cloud Analytics News will share the important news on Apache Kylin, Kyligence Cloud, and related technologies. In this edition, we cover Apache Kylin 4.X beta, the launch of Kyligence Cloud 4, Pivot to Snowflake, and more.
UnionPay was able to consolidate the 1,200 Cognos cubes into 2 Kyligence cubes and a single ETL process. Besides extending the life of the analytics executed against this data, there was a massive improvement in operational efficiency.
A peek behind the curtain of the world's leading open source big data analytics project, Apache Kylin.
An introduction to Apache Kylin's new storage and compute architecture, Apache Parquet. This article introduces Kylin's query principles, Parquet storage, and accurate duplicate removal
Already have an account? Click here to login
You'll get
A complete product experience
A guided demo of the whole process, from data import, modeling to analysis, by our data experts.
Q&A session with industry experts
Our data experts will answer your questions about customized solutions.
Please fill in your contact information.We'll get back to you in 1-2 business days.
Industrial Scenario Demostration
Scenarios in Finance, Retail, Manufacturing industries, which best meet your business requirements.
Consulting From Experts
Talk to Senior Technical Experts, and help you quickly adopt AI applications.