首页 \ 问答 \ Java NIO与非NIO性能(Java NIO vs Non NIO Performance)

Java NIO与非NIO性能(Java NIO vs Non NIO Performance)

 我花了相当多的时间来尝试优化文件哈希算法，以便尽可能地降低每一次性能。  
 看我之前的SO主题：  
 获取文件哈希性能/优化  
 FileChannel ByteBuffer和Hashing Files  
 确定适当的缓冲区大小  
 建议多次使用Java NIO来获得本机性能提升（通过将缓冲区保留在系统中而不是将它们引入JVM）。 但是，我的NIO代码在基准测试中运行得相当慢（使用每种算法反复散列相同的文件，以消除任何可能导致结果偏差的操作系统/驱动器“魔力”。  
 我现在有两种方法可以做同样的事情：  
 This one runs faster almost every time:  
/**
 * Gets Hash of file.
 * 
 * @param file String path + filename of file to get hash.
 * @param hashAlgo Hash algorithm to use. <br/>
 *     Supported algorithms are: <br/>
 *     MD2, MD5 <br/>
 *     SHA-1 <br/>
 *     SHA-256, SHA-384, SHA-512
 * @param BUFFER Buffer size in bytes. Recommended to stay in<br/>
 *          multiples of 2 such as 1024, 2048, <br/>
 *          4096, 8192, 16384, 32768, 65536, etc.
 * @return String value of hash. (Variable length dependent on hash algorithm used)
 * @throws IOException If file is invalid.
 * @throws HashTypeException If no supported or valid hash algorithm was found.
 */
public String getHash(String file, String hashAlgo, int BUFFER) throws IOException, HasherException {
    StringBuffer hexString = null;
    try {
        MessageDigest md = MessageDigest.getInstance(validateHashType(hashAlgo));
        FileInputStream fis = new FileInputStream(file);

        byte[] dataBytes = new byte[BUFFER];

        int nread = 0;
        while ((nread = fis.read(dataBytes)) != -1) {
            md.update(dataBytes, 0, nread);
        }
        fis.close();
        byte[] mdbytes = md.digest();

        hexString = new StringBuffer();
        for (int i = 0; i < mdbytes.length; i++) {
            hexString.append(Integer.toHexString((0xFF & mdbytes[i])));
        }

        return hexString.toString();

    } catch (NoSuchAlgorithmException | HasherException e) {
        throw new HasherException("Unsuppored Hash Algorithm.", e);
    }
}
 
 My Java NIO method that runs considerably slower most of the time:  
/**
 * Gets Hash of file using java.nio File Channels and ByteBuffer 
 * <br/>for native system calls where possible. This may improve <br/>
 * performance in some circumstances.
 * 
 * @param fileStr String path + filename of file to get hash.
 * @param hashAlgo Hash algorithm to use. <br/>
 *     Supported algorithms are: <br/>
 *     MD2, MD5 <br/>
 *     SHA-1 <br/>
 *     SHA-256, SHA-384, SHA-512
 * @param BUFFER Buffer size in bytes. Recommended to stay in<br/>
 *          multiples of 2 such as 1024, 2048, <br/>
 *          4096, 8192, 16384, 32768, 65536, etc.
 * @return String value of hash. (Variable length dependent on hash algorithm used)
 * @throws IOException If file is invalid.
 * @throws HashTypeException If no supported or valid hash algorithm was found.
 */
public String getHashNIO(String fileStr, String hashAlgo, int BUFFER) throws IOException, HasherException {

    File file = new File(fileStr);

    MessageDigest md = null;
    FileInputStream fis = null;
    FileChannel fc = null;
    ByteBuffer bbf = null;
    StringBuilder hexString = null;

    try {
        md = MessageDigest.getInstance(hashAlgo);
        fis = new FileInputStream(file);
        fc = fis.getChannel();
        bbf = ByteBuffer.allocateDirect(BUFFER); // allocation in bytes - 1024, 2048, 4096, 8192

        int b;

        b = fc.read(bbf);

        while ((b != -1) && (b != 0)) {
            bbf.flip();

            byte[] bytes = new byte[b];
            bbf.get(bytes);

            md.update(bytes, 0, b);

            bbf.clear();
            b = fc.read(bbf);
        }

        fis.close();

        byte[] mdbytes = md.digest();

        hexString = new StringBuilder();

        for (int i = 0; i < mdbytes.length; i++) {
            hexString.append(Integer.toHexString((0xFF & mdbytes[i])));
        }

        return hexString.toString();

    } catch (NoSuchAlgorithmException e) {
        throw new HasherException("Unsupported Hash Algorithm.", e);
    }
}
 
 我的想法是Java NIO尝试使用本机系统调用，以便在系统中和JVM之外保持处理和存储（缓冲区） - 这样（在理论上）这可以防止程序不断地在程序之间来回移动。 JVM和系统。 从理论上讲，这应该更快......但也许我的MessageDigest强制JVM引入缓冲区，否定了本机缓冲区/系统调用带来的任何性能改进？ 我在这个逻辑中是正确的还是我离开了？  
 Please help me understand why Java NIO is not better in this scenario. 

I've spent considerable time attempting to optimize a file hashing algorithm to eek out every last drop of performance possible. 
See my previous SO threads: 
Get File Hash Performance/Optimization 
FileChannel ByteBuffer and Hashing Files 
Determining Appropriate Buffer Size 
It was recommened several times to use Java NIO to gain native performance increases (by keeping the buffer's in the system instead of bringing them into the JVM). However, my NIO code runs considerably slower un benchmarks (hashing the same files over and over with each algorithm to negate any OS/Drive "magic" that could be skewing results. 
I now have two methods that do the same thing: 
This one runs faster almost every time: 
/**
 * Gets Hash of file.
 * 
 * @param file String path + filename of file to get hash.
 * @param hashAlgo Hash algorithm to use. <br/>
 *     Supported algorithms are: <br/>
 *     MD2, MD5 <br/>
 *     SHA-1 <br/>
 *     SHA-256, SHA-384, SHA-512
 * @param BUFFER Buffer size in bytes. Recommended to stay in<br/>
 *          multiples of 2 such as 1024, 2048, <br/>
 *          4096, 8192, 16384, 32768, 65536, etc.
 * @return String value of hash. (Variable length dependent on hash algorithm used)
 * @throws IOException If file is invalid.
 * @throws HashTypeException If no supported or valid hash algorithm was found.
 */
public String getHash(String file, String hashAlgo, int BUFFER) throws IOException, HasherException {
    StringBuffer hexString = null;
    try {
        MessageDigest md = MessageDigest.getInstance(validateHashType(hashAlgo));
        FileInputStream fis = new FileInputStream(file);

        byte[] dataBytes = new byte[BUFFER];

        int nread = 0;
        while ((nread = fis.read(dataBytes)) != -1) {
            md.update(dataBytes, 0, nread);
        }
        fis.close();
        byte[] mdbytes = md.digest();

        hexString = new StringBuffer();
        for (int i = 0; i < mdbytes.length; i++) {
            hexString.append(Integer.toHexString((0xFF & mdbytes[i])));
        }

        return hexString.toString();

    } catch (NoSuchAlgorithmException | HasherException e) {
        throw new HasherException("Unsuppored Hash Algorithm.", e);
    }
}
 
My Java NIO method that runs considerably slower most of the time: 
/**
 * Gets Hash of file using java.nio File Channels and ByteBuffer 
 * <br/>for native system calls where possible. This may improve <br/>
 * performance in some circumstances.
 * 
 * @param fileStr String path + filename of file to get hash.
 * @param hashAlgo Hash algorithm to use. <br/>
 *     Supported algorithms are: <br/>
 *     MD2, MD5 <br/>
 *     SHA-1 <br/>
 *     SHA-256, SHA-384, SHA-512
 * @param BUFFER Buffer size in bytes. Recommended to stay in<br/>
 *          multiples of 2 such as 1024, 2048, <br/>
 *          4096, 8192, 16384, 32768, 65536, etc.
 * @return String value of hash. (Variable length dependent on hash algorithm used)
 * @throws IOException If file is invalid.
 * @throws HashTypeException If no supported or valid hash algorithm was found.
 */
public String getHashNIO(String fileStr, String hashAlgo, int BUFFER) throws IOException, HasherException {

    File file = new File(fileStr);

    MessageDigest md = null;
    FileInputStream fis = null;
    FileChannel fc = null;
    ByteBuffer bbf = null;
    StringBuilder hexString = null;

    try {
        md = MessageDigest.getInstance(hashAlgo);
        fis = new FileInputStream(file);
        fc = fis.getChannel();
        bbf = ByteBuffer.allocateDirect(BUFFER); // allocation in bytes - 1024, 2048, 4096, 8192

        int b;

        b = fc.read(bbf);

        while ((b != -1) && (b != 0)) {
            bbf.flip();

            byte[] bytes = new byte[b];
            bbf.get(bytes);

            md.update(bytes, 0, b);

            bbf.clear();
            b = fc.read(bbf);
        }

        fis.close();

        byte[] mdbytes = md.digest();

        hexString = new StringBuilder();

        for (int i = 0; i < mdbytes.length; i++) {
            hexString.append(Integer.toHexString((0xFF & mdbytes[i])));
        }

        return hexString.toString();

    } catch (NoSuchAlgorithmException e) {
        throw new HasherException("Unsupported Hash Algorithm.", e);
    }
}
 
My thoughts are that Java NIO attempts to use native system calls and such to keep processing and storage (buffers) in the system and out of the JVM - this prevents (in theory) the program from having to constantly shuffle things back and forth between the JVM and the system. In theory this should be faster... but perhaps my MessageDigest forces the JVM to bring the buffer in, negating any performance improvements the native buffers/system calls can bring? Am I correct in this logic or am I way off?  
Please help me understand why Java NIO is not better in this scenario.

原文：https://stackoverflow.com/questions/16321299

更新时间：2023-06-26 18:06

最满意答案

 请更改FilterExpression ，如下所述。  
FilterExpression="attribute_not_exists(age) AND attribute_not_exists(address)",

Please change the FilterExpression as mentioned below. 
FilterExpression="attribute_not_exists(age) AND attribute_not_exists(address)",

Java NIO与非NIO性能(Java NIO vs Non NIO Performance)

最满意答案

相关问答

使用boto3 dynamodb客户端的FilterExpression语法错误(FilterExpression Syntax error using boto3 dynamodb client)[2023-10-26]

AWS boto3 create_policy（） - 指定policyDocument(AWS boto3 create_policy() - specify policyDocument)[2022-06-04]

Boto3不会将快照复制到其他区域，其他选项？(Boto3 not copying snapshot to other regions, other options?)[2022-04-27]

使用boto3获取aws帐户根用户详细信息时出错(Error on getting aws account root user details using boto3)[2023-06-14]

boto3 dynamodb IN条件(boto3 dynamodb IN condition)[2022-09-11]

迭代通过ec2描述实例boto3(Iterate thru ec2 describe instance boto3)[2022-03-04]

boto3错误无法获取实例的标签(boto3 error unable to get tags for instances)[2022-06-30]

与DynamoDB库Boto3的事务(Transactions with DynamoDB library Boto3)[2022-04-05]

为DynamoDB.Table等boto3资源键入注释(Type annotation for boto3 resources like DynamoDB.Table)[2022-10-13]

AWS DynamoDB Python - 无法识别boto3 Key（）方法（查询）(AWS DynamoDB Python - boto3 Key() methods not recognized (Query))[2021-12-31]

相关文章

最新问答