相信很多人学习hadoop都是从hadoop权威指南开始的，但权威指南使用的hadoop版本是0.19版本的，而有部分人（其中包括我）使用的0.20版本的。相信大家都知道0.20版本相对于0.19版本有了重大的改变。提供了一系列新的API。具体哪些我这里就不具体说了。其中一个跟测试、调试密切相关的就是在0.20版本出现了Context object（上下文对象）.所以本篇日志就记录一下我在0.20版本下的测试、调试程序。这里有要特别提示下，这些方法都是我自己摸索的，不敢保证一定效果最好或者最简洁，比如计数器那个我也见过其他实现方法。所以如果有错请大家指出。先谢谢了。

先来说说测试，老规矩直接上代码，注释在代码里：

public class TestMapper {
@Test
public void processReduce() throws IOException{
wordcountReduce reuder = new wordcountReduce(); //reduce测试
LongWritable key = new LongWritable(1234);
List<LongWritable> list = new ArrayList<LongWritable>();
list.add(new LongWritable(10));
list.add(new LongWritable(2));
Iterable<LongWritable> values = list; //构造Iterable
//OutputCollector<LongWritable, LongWritable> output = mock(OutputCollector.class); 老版本测试
Reducer.Context context = mock(Reducer.Context.class); //这里要注明Reduce.context上下文对象
try {
//reude.reduce(key, values, output, null); 老版本测试
reuder.reduce(key, values, context); //使用上下文对象代替上面的output
verify(context).write(new LongWritable(12), new LongWritable(1234));
}catch (InterruptedException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
}
@Test
public void processMap()throws IOException{
wordcountMapper mapper = new wordcountMapper(); //map测试
Text value = new Text("1234");
LongWritable key = new LongWritable(1);
//OutputCollector<LongWritable, Text> output = mock(OutputCollector.class); 老版本测试
Mapper.Context context = mock(Mapper.Context.class);
try{
//mapper.map(key, value, output, null); 老版本测试
mapper.map(key, value, context);
verify(context).write(new LongWritable(1234), new LongWritable(1));
} catch (InterruptedException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
}
public static class wordcountMapper extends
Mapper<LongWritable, Text, LongWritable, LongWritable>{
public void map(LongWritable key, Text value, Context context)throws IOException, InterruptedException{
String one = value.toString();
context.write(new LongWritable(Integer.parseInt(one)) , key);
}
}
public class wordcountReduce extends
Reducer<LongWritable, LongWritable, LongWritable, LongWritable>{
public void reduce(LongWritable key, Iterable<LongWritable>values, Context context)throws IOException, InterruptedException{
int sum = 0;
for (LongWritable str : values){
sum += str.get();
}
context.write(new LongWritable(sum), key );
}
}
}

相关问答

手机调试Android程序出异常时不打印堆栈信息[2021-10-17]

打印堆栈是调试的常用方法，一般在系统异常时，我们可以将异常情况下的堆栈打印出来，这样十分方便错误查找。实际上还有另外一个非常有用的功能：分析代码的行为。android代码太过庞大复杂了，完全的静态分析经常是无从下手，因此通过打印堆栈的动态分析也十分必要。 Android打印堆栈的方法，简单归类一下 1. zygote的堆栈dump 实际上这个可以同时dump java线程及native线程的堆栈，对于java线程，java堆栈和native堆栈都可以得到。使用方法很简单，直接在adb shell或串口中输 ...
如何在win7下的eclipse中调试Hadoop2.2.0的程序[2022-07-06]

privatestaticStringcheckHadoopHome(){//firstchecktheDflaghadoop.home.dirwithJVMscope//System.setProperty("hadoop.home.dir","");Stringhome=System.getProperty("hadoop.home.dir");//fallbacktothesystem/user-globalenvvariableif(home==null){home=System.getenv("H ...
hadoop的MapReduce程序运行操作问题[2022-03-24]

都可以，简单的直接用txt打开java文件，写好后打包成class文件，就可以运行了。你看他原来在哪里放class文件的，你就放在那里
基准测试Hadoop Map-Reduce应用程序(Benchmarking Hadoop Map-Reduce application)[2023-10-26]

JobTracker Web UI为您提供了非常有用的报告，可以比较每个映射器和reducer的可用日志。另请查看hadoop-test.jar存档中的mrbench类。网上有大量有关Hadoop集群基准测试用法的信息，如本文所述。 JobTracker web UI gives you very useful reports which allow to compare everything up to available logs for every mapper and reducer. Als ...
如何使用log调试mapreduce（hadoop-2.5.1）程序？(How to debug mapreduce (hadoop-2.5.1) programs using log or Eclipse-hdt?)[2022-07-11]

使用ResourceManager的UI ：如果您在群集中运行YARN，那么您的群集中将不会运行JobTracker，因此您无法访问http://localhost:50030/jobtracker.jsp ，而是会运行ResourceManager并且您可以访问ResourceManager的Web页面访问http://RESOURCE_MANAGER_HOST:8088/cluster （用您的ResourceManager的IP地址替换RESOURCE_MANAGER_HOST）。在Resourc ...
Visual Studio测试主机不加载调试信息(Visual Studio test host does not load debug information)[2022-07-20]

原来这条消息可以忽略不计。虽然主机进程确实没有调试信息，但应正确加载测试项目的PDB文件，以便测试代码中的断点将被命中。 Turns out this message can be ignored. While the host process is indeed without debug information, the test project's PDB file should be loaded correctly, so that breakpoints in the test code wi ...
在Windows中的eclipse中调试hadoop Wordcount程序(Debugging hadoop Wordcount program in eclipse in windows)[2022-04-11]

我怀疑Hadoop是否安装正确。如果所有守护程序都在运行，请检查您的机器。如果没有，请考虑重新检查或重新安装您缺少的内容。 ERROR [main] util.Shell (Shell.java:getWinUtilsPath(373)) - Failed to locate the winutils binary in the hadoop binary path java.io.IOException: Could not locate executable I doubt if Hadoop ...
任何类似于Apache Hadoop的测试框架/解决方案？(Any tested Frameworks/Solutions similar to Apache Hadoop?)[2022-12-14]

也许。但是他们中没有一个会在测试中接近hadoop的真实世界体验。像Facebook和雅虎这样的公司正在付钱来规模hadoop，我也知道没有类似的开源项目值得期待。 Maybe. But none of them will have anywhere near the testing a real world experience that hadoop does. Companies like facebook and yahoo are paying to scale hadoop and I kn ...
Hadoop Resource Manager存储应用程序信息多长时间？(How long does the Hadoop Resource Manager store the application information?)[2023-10-02]

您应该检查mapred-site.xml并查看mapreduce.jobhistory.max-age-ms 。如： https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/mapred-default.xml 运行历史记录清理程序时，将删除早于此毫秒的作业历史记录文件。默认为604800000（1周）。如果要读取资源使用情况，则应考虑使用作业历史记录服务器的Job API和 ...
使用Eclipse开发，测试和调试Hadoop map / reduce作业(Developing, testing and debugging Hadoop map/reduce jobs with Eclipse)[2022-09-10]

我通过以下方式在Eclipse中开发Cassandra / Hadoop应用程序：使用maven（m2e）为我的Eclipse项目收集和配置依赖项（Hadoop，Cassandra，Pig等）创建测试用例（src / test / java中的类）来测试我的映射器和缩减器。诀窍是使用扩展RecordWriter和StatusReporter的内部类动态构建上下文对象。如果执行此操作，则在调用setup / map / cleanup或setup / reduce / cleanup之后，您可以断言正 ...

知识点

相关文章

最近更新

Hadoop下的程序测试及调试信息

相关问答

手机调试Android程序出异常时不打印堆栈信息[2021-10-17]

如何在win7下的eclipse中调试Hadoop2.2.0的程序[2022-07-06]

hadoop的MapReduce程序运行操作问题[2022-03-24]

基准测试Hadoop Map-Reduce应用程序(Benchmarking Hadoop Map-Reduce application)[2023-10-26]

如何使用log调试mapreduce（hadoop-2.5.1）程序？(How to debug mapreduce (hadoop-2.5.1) programs using log or Eclipse-hdt?)[2022-07-11]

Visual Studio测试主机不加载调试信息(Visual Studio test host does not load debug information)[2022-07-20]

在Windows中的eclipse中调试hadoop Wordcount程序(Debugging hadoop Wordcount program in eclipse in windows)[2022-04-11]

任何类似于Apache Hadoop的测试框架/解决方案？(Any tested Frameworks/Solutions similar to Apache Hadoop?)[2022-12-14]

Hadoop Resource Manager存储应用程序信息多长时间？(How long does the Hadoop Resource Manager store the application information?)[2023-10-02]

使用Eclipse开发，测试和调试Hadoop map / reduce作业(Developing, testing and debugging Hadoop map/reduce jobs with Eclipse)[2022-09-10]