Hadoop - MapReduce（5）

论坛元老

Rank: 8 Rank: 8

UID: 1066743

1^#

打印

字体大小: tT

look_w发表于 2019-1-16 19:39 | 只看该作者

Hadoop - MapReduce（5）

MapReduce - 编程在线练习处理

select：直接分析输入数据，取出需要的字段数据即可
where: 也是对输入数据处理的过程中进行处理，判断是否需要该数据
aggregation:min, max, sum
group by: 通过Reducer实现
sort
join: map join, reduce join

Third-Party Librariesexport LIBJARS=M
Y
LI
B/commons−lang
−2.3.jar,<other
j
ars
u
sed

b
y

r
emote
c
omponents
>had
oopjarprohad
oop−0.0.1−S
N
AP
S
H
OT
.jarorg
.aspress.prohad
oop
.c3.W
ord
C
ountU
sing
T
oolRunner−libjars

LIBJARS<input_path><output_path>
hadoop jar prohadoop-0.0.1-SNAPSHOT-jar-with-dependencies.jar org.aspress.prohadoop.c3. WordCountUsingToolRunner <input_path>The dependent libraries are now included inside the application JAR file
一般还是上面的好，指定依赖可以利用Public Cache，如果是包含依赖，则每次都需要拷贝

收藏分享评分

回复引用

订阅 TOP

返回列表