Technically what is the difference between s3n, s3a and s3?

3.9K    Asked by HasegawaSaito in Data Science , Asked on Jul 12, 2021

Can someone explain the basic difference that distinguishes s3n, s3a and s3 in Hadoop? Technically how are they different?

Answered by David EDWARDS

S3 is a block-based overlay on top of Amazon S3, whereas S3N/S3A are not.

These are object-based. The difference between S3 and S3N/S3A is that S3 is a block-based overlay on top of Amazon S3, while S3N or the S3A is not because them being more object-based.

S3A is the successor to S3N. The difference between S3N and S3A is that S3N supports objects up to 5GB in size, while S3A supports objects within 5TB and it has much higher performance.

S3n is a native filesystem for reading and writing regular files on S3. The advantage of this filesystem is that you can access files on S3 that were written with other tools. Conversely, other tools can access files written using Hadoop.



Your Answer

Answer (1)

Thanks, David for the detailed reply. This is exactly what I was looking for. Again appreciate your reply on basket random.

5 Days

Interviews

Parent Categories