Technically what is the difference between s3n, s3a and s3?
Can someone explain the basic difference that distinguishes s3n, s3a and s3 in Hadoop? Technically how are they different?
S3 is a block-based overlay on top of Amazon S3, whereas S3N/S3A are not.
These are object-based. The difference between S3 and S3N/S3A is that S3 is a block-based overlay on top of Amazon S3, while S3N or the S3A is not because them being more object-based.
S3A is the successor to S3N. The difference between S3N and S3A is that S3N supports objects up to 5GB in size, while S3A supports objects within 5TB and it has much higher performance.
S3n is a native filesystem for reading and writing regular files on S3. The advantage of this filesystem is that you can access files on S3 that were written with other tools. Conversely, other tools can access files written using Hadoop.