当前位置: 首页 > article >正文

hadoop3.x 新特性

hadoop3.x 新特性

FeaturesHadoop 2.xHadoop 3.x
Minimum Required Java VersionJDK 6 and above.JDK 8 is the minimum runtime version of JAVA required to run Hadoop 3.x as many dependency library files have been used from JDK 8.
Fault ToleranceFault Tolerance is handled through replication leading to storage and network bandwidth overhead.(3个副本)Support for Erasure Coding(纠错码) in HDFS improves fault tolerance (0.5纠错码+1数据 = 1.5倍磁盘占用)
Storage SchemeFollows a 3x Replication Scheme for data recovery leading to 200% storage overhead. For instance, if there are 8 data blocks then a total of 24 blocks will occupy the storage space because of the 3x replication scheme.Storage overhead in Hadoop 3.0 is reduced to 50% with support for Erasure Coding. In this case, if here are 8 data blocks then a total of only 12 blocks will occupy the storage space.
Change in Port NumbersHadoop HDFS NameNode -8020 Hadoop HDFS DataNode -50010 Secondary NameNode HTTP -50091Hadoop HDFS NameNode -9820 Hadoop HDFS DataNode -9866 Secondary NameNode HTTP -9869
YARN Timeline ServiceYARN timeline service introduced in Hadoop 2.0 has some scalability issues.YARN Timeline service has been enhanced with ATS v2 which improves the scalability and reliability.
Intra DataNode BalancingHDFS Balancer in Hadoop 2.0 caused skew within a DataNode because of addition or replacement of disks.Intra DataNode Balancing has been introduced in Hadoop 3.0 to address the intra-DataNode skews which occur when disks are added or replaced.
Number of NameNodesHadoop 2.0 introduced a secondary namenode as standby.(一主一备)Hadoop 3.0 supports 2 or more NameNodes.(一主多备)
Heap SizeIn Hadoop 2.0 , for Java and Hadoop tasks, the heap size needs to be set through two similar properties mapreduce.{map,reduce}.java. Opts and mapreduce.{map,reduce}.memory.mbIn Hadoop 3.0, heap size or mapreduce.*.memory.mb is derived automatically.
hdfs HA 逻辑
  1. 增加用于主备之间信息共享推送的 JournalNode
    JournalNode 是 hadoop 根据 paxos 协议实现的日志服务
  2. 增加用于选主决策的 zookeeper 集群:ha.zookeeper.quorum 配置
  3. 增加用于监控同机器上的 namenode,试图选举,切换本地 namenode 的 active,standby 状态的zookeeper failover controller(zkfc)进程:QuorumPeerMain

http://www.kler.cn/a/400096.html

相关文章:

  • JavaWeb——JS、Vue
  • ssm129办公用品管理系统开发与设计+jsp(论文+源码)_kaic
  • 如何使用正则表达式验证域名
  • 深度学习transformer
  • 技术理论||02空中三角测量
  • C++学习-空指针推荐使用nullptr
  • 学习threejs,使用TWEEN插件实现动画
  • 利用正则表达式批量修改文件名
  • Python读取prophesee相机输出的raw文件
  • java itext后端生成pdf导出
  • 企业架构框架之银行业参考架构BIAN
  • 数据分析-50-时间序列信息编码之采用正余弦循环编码
  • kafka-clients之max.block.ms
  • 【时间之外】IT人求职和创业应知【37】-AIGC私有化
  • 关于GCC内联汇编(也可以叫内嵌汇编)的简单学习
  • 基于GPU器件行为的创新分布式功能安全机制为智能驾驶保驾护航
  • 2. kafka 生产者
  • 【python】使用 DrissionPage 库进行网页自动化操作和数据提取
  • 【云原生后端开发流程及详细教程】
  • IDEA 开发工具常用快捷键有哪些?
  • zookeeper安装教程
  • openwebui使用
  • node.js 入门级基础应用
  • 【Java 集合】Collections 空列表细节处理
  • Spark_写ORALCE:ORA-01426 numeric overflow 问题解决
  • 在 Qt 中使用 OpenGL 详解