首页 \ 问答 \ 在Python中包装多行字符串（保留现有的换行符）？(Wrap multiline string (preserving existing linebreaks) in Python?)

在Python中包装多行字符串（保留现有的换行符）？(Wrap multiline string (preserving existing linebreaks) in Python?)

 考虑这个例子：  
import textwrap
import pprint

mystr=r"""
First line.
Second line.
The third line is a very long line, which I would like to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be done ??"""

pprint.pprint(textwrap.wrap(mystr,80))
 
 字符串mystr已经是一个多行字符串，因为它包含换行符; 但是，如果我运行此脚本，我得到输出：  
[' First line. Second line. The third line is a very long line, which I would like',
 'to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be',
 'done ??']
 
 ...这意味着textwrap.wrap首先“加入”多行字符串（即删除其中的现有换行符），然后才将其包装（即将其拆分为给定的字符数）。  
 如何包装多行字符串，以便保留换行符？ 也就是说，在这种情况下，预期输出将是：  
['First line.', 
 'Second line.', 
 'The third line is a very long line, which I would like to somehow wrap; wrap at',
 '80 characters - or less, or more! ... can it really be done ??']
 
 
 编辑; 感谢@u_mulder的评论，我试过：  
textwrap.wrap(mystr,80,replace_whitespace=False)
 
 我得到了：  
['\nFirst line.\nSecond line.\nThe third line is a very long line, which I would like',
 'to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be',
 'done ??']
 
 换行似乎被保留，但作为“内联”字符; 所以这里第一个元素本身就是一个多行字符串 - 所以它不是我需要的，每一行都是一个数组元素。 

Consider this example: 
import textwrap
import pprint

mystr=r"""
First line.
Second line.
The third line is a very long line, which I would like to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be done ??"""

pprint.pprint(textwrap.wrap(mystr,80))
 
The string mystr is already a multiline string, given that it contains linebreaks; however, if I run this script, I get as output: 
[' First line. Second line. The third line is a very long line, which I would like',
 'to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be',
 'done ??']
 
... which means that textwrap.wrap first "joined" the multiline string (that is, removed the existing linebreaks in it), and only then wrapped it (i.e. split it at the given number of characters).  
How can I wrap a multiline string, such that the line feeds are preserved? that is, in this case, the expected output would be: 
['First line.', 
 'Second line.', 
 'The third line is a very long line, which I would like to somehow wrap; wrap at',
 '80 characters - or less, or more! ... can it really be done ??']
 
 
EDIT; thanks to comment by @u_mulder, I tried: 
textwrap.wrap(mystr,80,replace_whitespace=False)
 
and with that I get: 
['\nFirst line.\nSecond line.\nThe third line is a very long line, which I would like',
 'to somehow wrap; wrap at 80 characters - or less, or more! ... can it really be',
 'done ??']
 
The line breaks seem to be preserved, but as "inline" characters; so here the first element is a multiline string in itself -- and so it is not as I require it, with every line as an array element.

原文：https://stackoverflow.com/questions/28863889

更新时间：2020-11-23 21:11

最满意答案

 找到了。 你用'H'覆盖标签的第一个字节。 其他字节都很好。 现在找出H来自哪里......  
nextLoc = nextLoc + sizeof(sizeType) + sizeof(unsigned char)
            + idSize*sizeof(char) + value.getSize();
 
 你需要在这里再添加一个。 你有skip（sizeType），长度字节（unsigned char），id本身（idSize * sizeof（char））和值（value.getSize（）），但是你还要在id和value之间留一个字节不考虑。 这就是为什么你的标签的最后一个字节被覆盖 - 并且因为你在一个小端机器上测试导致最高字节被破坏。  
    for(int i = 0; i < *((unsigned char*)idSize); ++i){
     ...
        tbl_char_ptr = tbl_char_ptr + sizeof(char);
    ...
    }

    result_ptr = tbl_char_ptr + sizeof(char);
 
 这比idSize多一个。 

Found it. You're overwriting the first byte of your tag with an 'H'. The other bytes are fine. Now to find where that H is coming from... 
nextLoc = nextLoc + sizeof(sizeType) + sizeof(unsigned char)
            + idSize*sizeof(char) + value.getSize();
 
You need to add one more here. You have the skip (sizeType), the length byte (unsigned char), the id itself (idSize * sizeof(char)) and the value (value.getSize()), but you also leave a byte between id and value that you're not accounting for. That's why the last byte of your tag is getting overwritten - and because you're testing on a little-endian machine that results in the highest byte being corrupted. 
    for(int i = 0; i < *((unsigned char*)idSize); ++i){
     ...
        tbl_char_ptr = tbl_char_ptr + sizeof(char);
    ...
    }

    result_ptr = tbl_char_ptr + sizeof(char);
 
That's one more than idSize.

在Python中包装多行字符串（保留现有的换行符）？(Wrap multiline string (preserving existing linebreaks) in Python?)

最满意答案

相关问答

我可以防止由std :: memcpy复制对象吗？(Can I prevent object from being copied by std::memcpy?)[2022-01-20]

C memcpy相反(C memcpy in reverse)[2022-08-21]

如何提高memcpy的性能(How to increase performance of memcpy)[2023-10-17]

memcpy（）vs memmove（）(memcpy() vs memmove())[2022-05-21]

在STL中使用memcpy(Using memcpy in the STL)[2024-01-13]

了解如何使用TheUnsafe进行memcpy(Understanding how to memcpy with TheUnsafe)[2023-07-06]

什么是C ++中的memcpy等价物(What is the memcpy equivalent in C++)[2022-10-30]

堆栈在memcpy之后损坏了(stack corrupted after memcpy)[2022-07-27]

memcpy和C ++类模板 - 如何使用它？(memcpy and C++ class templates - how to use it?)[2021-11-24]

对象的C ++ memcpy副本显示已损坏(C++ memcpy copy of object appears corrupted)[2023-11-17]

相关文章

最新问答