Spark在windows运行报错-ERROR Shell Failed to locate the winutils binary in the hadoop binary path java.io

winutils

Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)

项目地址：https://gitcode.com/gh_mirrors/wi/winutils

免费下载资源

Au-csdn

966人浏览 · 2019-08-06 19:12:43

Au-csdn · 2019-08-06 19:12:43 发布

`Spark`在windows运行报错-ERROR Shell Failed to locate the winutils binary in the hadoop binary path java.io.IOException Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

在windows的idea运行spark程序，报了如下错：

Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
19/08/06 18:53:56 INFO SparkContext: Running Spark version 2.4.3
19/08/06 18:53:56 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
19/08/06 18:53:56 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.
	at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:378)
	at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:393)
	at org.apache.hadoop.util.Shell.<clinit>(Shell.java:386)
	at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:79)
	at org.apache.hadoop.security.Groups.parseStaticMapping(Groups.java:116)
	at org.apache.hadoop.security.Groups.<init>(Groups.java:93)
	at org.apache.hadoop.security.Groups.<init>(Groups.java:73)
	at org.apache.hadoop.security.Groups.getUserToGroupsMappingService(Groups.java:293)
	at org.apache.hadoop.security.UserGroupInformation.initialize(UserGroupInformation.java:283)
	at org.apache.hadoop.security.UserGroupInformation.ensureInitialized(UserGroupInformation.java:260)
	at org.apache.hadoop.security.UserGroupInformation.loginUserFromSubject(UserGroupInformation.java:789)
	at org.apache.hadoop.security.UserGroupInformation.getLoginUser(UserGroupInformation.java:774)
	at org.apache.hadoop.security.UserGroupInformation.getCurrentUser(UserGroupInformation.java:647)
	at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2422)
	at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2422)
	at scala.Option.getOrElse(Option.scala:121)
	at org.apache.spark.util.Utils$.getCurrentUserName(Utils.scala:2422)
	at org.apache.spark.SparkContext.<init>(SparkContext.scala:293)
	at org.apache.spark.SparkContext$.getOrCreate(SparkContext.scala:2520)
	at com.au.common.Base$.main(Base.scala:17)
	at com.au.common.Base.main(Base.scala)

解决方案：指定一个winutils.exe文件即可。

下载我给的bin目录文件。
百度云链接：https://pan.baidu.com/s/1422rEurIxnMr6wPJr5G8UA
提取码：ymm2
复制这段内容后打开百度网盘手机App，操作更方便哦。
将bin放到任意目录，我这里是D:/test目录下（注意一定要在winutils.exe外套上bin，不然还是会报错）。
在代码中指定该bin目录的上一级，即test目录（实际上就是模拟hadoop），代码为：System.setProperty("hadoop.home.dir", "D:/test")。

测试代码，亲测有效（词频统计）：

import org.apache.spark.{SparkConf, SparkContext}

object Base {
  def main(args: Array[String]): Unit = {

    System.setProperty("hadoop.home.dir", "D:/test") // 加入这句代码，将下载的bin目录放到任意目录，我这里是新建的test目录

    val conf = new SparkConf().setMaster("local[*]").setAppName("wordcount")
    val sc = SparkContext.getOrCreate(conf)

    val rdd = sc.textFile("D:/markdown note/Flume学习笔记.md")
    rdd.flatMap(_.split(" ")).groupBy(x => x).mapValues(x => x.size).foreach(println)
  }
}

GitHub 加速计划 / wi / winutils

下载

Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)

最近提交(Master分支：18 天前 )

e8089ecf - 2 年前

d4f71517 point people at cdarlint/winutils for binaries and call out the fact that we could remove the need for this entirely just to run spark on windows 6 年前

GitCode 开源社区

旨在为数千万中国开发者提供一个无缝且高效的云端环境，以支持学习、使用和贡献开源项目。

更多推荐

沁言学术 vs Grammarly：中文学术写作与语料库本地化支持的表现剖析

Grammarly是全球写作工具，语料库以英文为主，支持基本中文检查；沁言学术是本土AI平台，语料库深度本地化，针对中文学术设计。中文学术写作：Grammarly基础语法/拼写（本地化弱），沁言学术AI生成/优化（深度支持）。语料库本地化：Grammarly通用库（英文主导），沁言学术本土库（CNKI等集成）。整体：Grammarly免费版通用，付费版高级；沁言学术免费版入门，AI付费优化。表现亮