Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-21077

Cannot access public files over S3 protocol

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • 2.1.0
    • None
    • EC2
    • None

    Description

      I am trying to access a dataset with public (anonymous) credentials via the S3 (or S3a, s3n) protocol.

      It fails with the error that no provider in chain can supply the credentials.
      I asked our sysadmin to add some dummy credentials, and if I set them up (via link or config) then I have access.

      I tried setting the config :

      <property>
        <name>fs.s3a.credentials.provider</name>
        <value>org.apache.hadoop.fs.s3a.AnonymousAWSCredentialsProvider</value>
      </property>
      

      but it still doesn't work.

      I suggested that it is a java-aws issue here, but they said it is not.

      Any hints on how to use public S3 files from Spark ?

      Attachments

        Activity

          People

            Unassigned Unassigned
            cipri_tom Ciprian Tomoiaga
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: