Unrecognized Hadoop major version number: 3.1.1

This is the error I’m getting in attempting to connect to a Cloudera cluster.

“Database name: Unrecognized Hadoop major version number: 3.1.1”

In looking through the SparkSQL driver jar, this version dependency is sprinkled through the code.

For example, in commons-math3/pom.xml (see below).

I’m running Hadoop 2.6. Can I gather from this that the Metabase SparkSQL driver is incompatible with a Hadoop 2.6 installation?

Thanks.

###Code doesn’t display properly due to xml format###

4.0.0

org.apache.hadoop
hadoop-project-dist
3.1.1
…/…/hadoop-project-dist

hadoop-common
3.1.1
Apache Hadoop Common
Apache Hadoop Common
jar

Hi @stiehm

I have zero knowledge about Hadoop, but it seems like you should use the same versions for best compatibility:
https://hadoop.apache.org/docs/r3.1.1/hadoop-project-dist/hadoop-common/AdminCompatibilityGuide.html

You might need to try changing the version and building your own driver (.bin/build-driver.sh sparksql):
https://github.com/metabase/metabase/blob/master/modules/drivers/sparksql/project.clj

I can find a couple of mentions of Cloudera in this issue - note that it’s fairly old comments/information:
https://github.com/metabase/metabase/issues/7528