A Comparative Study on Performance and Resource Utilization of Real-time Distributed Messaging Systems for Big Data

64

Views

0

Downloads

Intorruk, Somprasong and Numnonda, Thanisa (2019) A Comparative Study on Performance and Resource Utilization of Real-time Distributed Messaging Systems for Big Data In: 2019 20th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2019-07-08, Toyama, Japan.

Abstract

In the past few years, data continuously increase and in various forms called Big Data. Besides, data analytics play an important role more and more in most organizations. From these reasons, many efficiently distributed messaging systems have been introducing to handle Big Data in real-time. However, choosing the appropriate and efficient methods and tools to transfer Big Data is still challenging. Therefore, this paper purposes of comparing the architecture, performance, and resource utilization between Apache Kafka which is one of the favorite tools for Big Data and Apache Pulsar which is similar to Kafka and become one of the latest tools for big data. After we implemented both systems in the same environment, the results show that Pulsar outperforms in throughput, latency, and average resource utilization especially when the size of messages is small (such as 1 KB and 1MB).

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

Deposited by:

ระบบ อัตโนมัติ

Date Deposited:

2021-09-09 23:53:44

Last Modified:

2022-06-22 13:14:04

Impact and Interest:

Statistics