首页 \ 问答 \ Hadoop和MapReduce(Hadoop and MapReduce)

Hadoop和MapReduce(Hadoop and MapReduce)

 我是HDFS和MapReduce的新手，并试图计算调查统计数据。 输入文件采用以下格式：年龄点性别类别 - 所有4个都是数字。 这是正确的开始：  
    public static class MapClass extends MapReduceBase
    implements Mapper<IntWritable, IntWritable, IntWritable, IntWritable> {
    private final static IntWritable Age = new IntWritable(1) ;
    private IntWritable AgeCount = new IntWritable() ;

    public void map( Text key, Text value,
                    OutputCollector<IntWritable, IntWritable> output,
                    Reporter reporter) throws IOException {
        AgeCount. set(Integer. parseInt(value. toString() ) ) ;
        output. collect(AgeCount, Age) ;
    }
}
 
 我的问题：1。这是一个正确的开始吗？ 2.如果我想收集其他属性，如性，点 - 我会添加另一个output.collect语句吗？ 我知道我必须阅读该行并分成属性。 3.它表示实现Mapper - 我使所有4 IntWritable都正确吗？ 

I am new to HDFS and MapReduce and trying to calculate survey statistics. Input file is in this format: Age Points Sex Category - all 4 of them are numbers. Is this the correct start: 
    public static class MapClass extends MapReduceBase
    implements Mapper<IntWritable, IntWritable, IntWritable, IntWritable> {
    private final static IntWritable Age = new IntWritable(1) ;
    private IntWritable AgeCount = new IntWritable() ;

    public void map( Text key, Text value,
                    OutputCollector<IntWritable, IntWritable> output,
                    Reporter reporter) throws IOException {
        AgeCount. set(Integer. parseInt(value. toString() ) ) ;
        output. collect(AgeCount, Age) ;
    }
}
 
My questions: 1. Is this a correct start? 2. If I want to collect for other attributes like Sex,Points - will I just add another output.collect statements? I know I have to read the line and split into attributes. 3. Where it says implements Mapper - I made all 4 IntWritable is it correct?

原文：https://stackoverflow.com/questions/5698693

更新时间：2023-06-28 16:06

最满意答案

 如果是文件，则需要阅读。  
 所以使用read.zoo()作为你 - 但然后立即转换：  
 gold <- as.xts(read.zoo("GOLD.CSV", sep=",", format="%m/%d/%Y", header=TRUE))
 
 好？ 

If it is a file, you need to read it.  
So use read.zoo() as you -- but then convert rightaway: 
 gold <- as.xts(read.zoo("GOLD.CSV", sep=",", format="%m/%d/%Y", header=TRUE))
 
Ok?

Hadoop和MapReduce(Hadoop and MapReduce)

最满意答案

相关问答

在R引用类中，如何将字段定义为“xts”对象(In R reference class, how to define fields as “xts” objects)[2022-06-13]

R中的xts的回归(regressions with xts in R)[2022-06-23]

R：SQL转换为pivoted xts对象(R: SQL into pivoted xts object)[2022-06-17]

适用于R中的xts对象(apply to an xts object in R)[2022-10-28]

将单词保留在R中的XTS对象中(Keep Words in an XTS object in R)[2023-10-23]

我在将数据帧转换为xts进行时间序列分析时做错了什么？(What am I doing wrong in converting my data frame into xts for time series analysis?)[2023-12-03]

R：将数据转换为xts对象(R: converting data to an xts object)[2023-03-15]

Rblpapi，取消列出data.frame zoo xts(Rblpapi, unlist data.frame zoo xts)[2023-10-12]

R根据data.frame中的两列创建时间序列作为xts索引(R Create a time sequence as xts index based on two columns in data.frame)[2023-01-18]

data.frame对象到R中的xts对象转换(data.frame object to xts object conversion in R)[2022-12-02]

相关文章

最新问答