首页 \ 问答 \ PDFJS和PDF编码(PDFJS and PDF encoding)

PDFJS和PDF编码(PDFJS and PDF encoding)

 我们正在实施PDFJS以在网站上呈现pdf文件。  
 当尝试将PDF文档/查看器作为arrayBuffer启动时，我们会遇到各种错误，并且不会呈现文件。 从url（DEFAULT_URL变量）打开查看器中的同一文件时，文件呈现正常。  
 但是有些文件会以流形式呈现。 在记事本中比较这些文件表明它们具有不同的编码/字符。  
 这段代码用于在查看器中打开文件：  
function rawStringToBuffer( str ) {
    var idx, len = str.length, arr = new Array( len );
    for ( idx = 0 ; idx < len ; ++idx ) {
        arr[ idx ] = str.charCodeAt(idx) & 0xFF;
    }
    return new Uint8Array( arr ).buffer;
}

function readSingleFile(e) {
  var file = e.target.files[0];
  if (!file) {
    return;
  }
  var reader = new FileReader();
  reader.onload = function(e) {
    var contents = e.target.result;

    var uint8array = rawStringToBuffer(contents);

    pdfjsframe.contentWindow.PDFViewerApplication.open(uint8array,0);

    };
    reader.readAsText(file);
}
 
 test.pdf helloworld pdf，不使用上面的代码呈现。  
   
 test2.pdf helloworld pdf，它使用上面的代码进行渲染。  
   
 该行为不依赖于浏览器。 构建是b15f335。  
 是否存在查看器的代码或默认配置，以便查看器无法呈现test.pdf？ 

We are implementing PDFJS to render pdf files on a website. 
When trying to initiate a PDFdocument/Viewer as an arrayBuffer, we get al sorts of errors and the file is not rendered. When opening the same file in the viewer from url (DEFAULT_URL variable), the file renders fine. 
There are however some files that do render as streams. Comparing these files in notepad shows they have different encoding/characters. 
This piece of code is used to open the file in the viewer: 
function rawStringToBuffer( str ) {
    var idx, len = str.length, arr = new Array( len );
    for ( idx = 0 ; idx < len ; ++idx ) {
        arr[ idx ] = str.charCodeAt(idx) & 0xFF;
    }
    return new Uint8Array( arr ).buffer;
}

function readSingleFile(e) {
  var file = e.target.files[0];
  if (!file) {
    return;
  }
  var reader = new FileReader();
  reader.onload = function(e) {
    var contents = e.target.result;

    var uint8array = rawStringToBuffer(contents);

    pdfjsframe.contentWindow.PDFViewerApplication.open(uint8array,0);

    };
    reader.readAsText(file);
}
 
test.pdf helloworld pdf which is not rendered with code above. 
 
test2.pdf helloworld pdf which does rendered with code above. 
 
The behaviour is not browser dependent. The build is b15f335. 
Is there something with the code or default configuration of the viewer so that test.pdf can not be rendered by the viewer?

原文：https://stackoverflow.com/questions/37673583

更新时间：2022-12-15 08:12

最满意答案

 我不确定如何有效地使用SIMD对任意矩阵进行就地转置，但我确实知道如何在不合适的地方进行。 让我来描述如何做到这两点  
 到位转置  
 对于就地转置，您应该在C ++手册中看到Agner Fog的优化软件 。 请参见第9.10节“大型数据结构中的缓存争用”示例9.5a。 对于某些矩阵大小，由于缓存别名，您将看到性能大幅下降。 有关示例，请参见表9.1， 为什么转换512x512的矩阵要比转置513x513的矩阵慢得多？ 。 Agner提供了一种方法来解决这个问题，使用示例9.5b中的循环平铺（类似于Paul R所描述的）。  
 不合适的转置  
 在这里查看我的答案（投票率最高的人） 在C ++中转换矩阵的最快方法是什么？ 。 我已经很久没看过了这个，但是让我在这里重复我的代码：  
inline void transpose4x4_SSE(float *A, float *B, const int lda, const int ldb) {
    __m128 row1 = _mm_load_ps(&A[0*lda]);
    __m128 row2 = _mm_load_ps(&A[1*lda]);
    __m128 row3 = _mm_load_ps(&A[2*lda]);
    __m128 row4 = _mm_load_ps(&A[3*lda]);
     _MM_TRANSPOSE4_PS(row1, row2, row3, row4);
     _mm_store_ps(&B[0*ldb], row1);
     _mm_store_ps(&B[1*ldb], row2);
     _mm_store_ps(&B[2*ldb], row3);
     _mm_store_ps(&B[3*ldb], row4);
}

inline void transpose_block_SSE4x4(float *A, float *B, const int n, const int m, const int lda, const int ldb ,const int block_size) {
    #pragma omp parallel for
    for(int i=0; i<n; i+=block_size) {
        for(int j=0; j<m; j+=block_size) {
            int max_i2 = i+block_size < n ? i + block_size : n;
            int max_j2 = j+block_size < m ? j + block_size : m;
            for(int i2=i; i2<max_i2; i2+=4) {
                for(int j2=j; j2<max_j2; j2+=4) {
                    transpose4x4_SSE(&A[i2*lda +j2], &B[j2*ldb + i2], lda, ldb);
                }
            }
        }
    }   
}

I'm not sure how to do a in-place transpose for arbitrary matrices using SIMD efficiently but I do know how to do it for out-of-place. Let me describe how to do both 
In place transpose 
For in-place transpose you should see Agner Fog's Optimizing software in C++ manual. See section 9.10 "Cache contentions in large data structures" example 9.5a. For certain matrix sizes you will see a large drop in performance due to cache aliasing. See table 9.1 for examples and this Why is transposing a matrix of 512x512 much slower than transposing a matrix of 513x513?. Agner gives a way to fix this using loop tiling (similar to what Paul R described) in Example 9.5b. 
Out of place transpose 
See my answer here (the one with the most votes) What is the fastest way to transpose a matrix in C++?. I have not looked into this in ages but let me just repeat my code here: 
inline void transpose4x4_SSE(float *A, float *B, const int lda, const int ldb) {
    __m128 row1 = _mm_load_ps(&A[0*lda]);
    __m128 row2 = _mm_load_ps(&A[1*lda]);
    __m128 row3 = _mm_load_ps(&A[2*lda]);
    __m128 row4 = _mm_load_ps(&A[3*lda]);
     _MM_TRANSPOSE4_PS(row1, row2, row3, row4);
     _mm_store_ps(&B[0*ldb], row1);
     _mm_store_ps(&B[1*ldb], row2);
     _mm_store_ps(&B[2*ldb], row3);
     _mm_store_ps(&B[3*ldb], row4);
}

inline void transpose_block_SSE4x4(float *A, float *B, const int n, const int m, const int lda, const int ldb ,const int block_size) {
    #pragma omp parallel for
    for(int i=0; i<n; i+=block_size) {
        for(int j=0; j<m; j+=block_size) {
            int max_i2 = i+block_size < n ? i + block_size : n;
            int max_j2 = j+block_size < m ? j + block_size : m;
            for(int i2=i; i2<max_i2; i2+=4) {
                for(int j2=j; j2<max_j2; j2+=4) {
                    transpose4x4_SSE(&A[i2*lda +j2], &B[j2*ldb + i2], lda, ldb);
                }
            }
        }
    }   
}

PDFJS和PDF编码(PDFJS and PDF encoding)

最满意答案

相关问答

使用gcc进行自动矢量化？(Getting auto-vectorization with gcc?)[2021-10-20]

GCC矢量化失败(vectorization fails with GCC)[2022-07-12]

适当的numpy矢量化(Proper numpy vectorization)[2023-09-16]

MATLAB矢量化：计算一个邻域矩阵(MATLAB vectorization: computing a neighborhood matrix)[2022-03-29]

矢量化和嵌套矩阵乘法(Vectorization and Nested Matrix Multiplication)[2023-12-25]

函数式编程中的矢量化(Vectorization in functional programming)[2022-05-28]

实际使用自动矢量化？(Practical use of automatic vectorization?)[2022-03-28]

矢量化计算许多距离(Vectorization to calculate many distances)[2022-08-20]

代码的矢量化(Vectorization of the code)[2023-05-04]

使用代码矢量化的矩阵运算(Matrix operations using code vectorization)[2023-07-29]

相关文章

最新问答