[Avg. reading time: 2 minutes]

Protocol

Protocols are standardized rules that govern how data is transmitted, formatted, and processed across systems.

In Big Data, protocols are essential for:

  • Data ingestion (getting data in)
  • Inter-node communication in clusters
  • Remote access to APIs/services
  • Serialization of structured data
  • Security and authorization
ProtocolLayerUse Case Example
HTTP/HTTPSApplicationREST API for ingesting external data
KafkaMessagingStream processing with Spark or Flink
gRPCRPCMicroservices in ML model serving
MQTTMessagingIoT data push to cloud
Avro/ProtoSerializationBinary data for logs and schema
OAuth/KerberosSecuritySecure access to data lakes
Last change: 2025-10-15