首页 \ 问答 \ 将txt文件读入R中的数据帧(reading txt file into data frame in R)

将txt文件读入R中的数据帧(reading txt file into data frame in R)

我有以下格式的文本文件:

   "C1","name1","type1": 2
   "C1","name2","type4": 6
   "C2","name1","type2": 1
   "C1","name3","type1": 10

我试过了:

   db<- read.table("myfile.txt")

但这个文件存储为两列,并将值存储为“name1”,我也试过:

  db<- read.csv("myfile.txt", header= FALSE)

但是这将最后两列存储为一列:

    C1   name1     type1:2
    C1   name2     type4:6
    C2   name1     type2:1
    C1   name3     type1:10

如何将最后两列存储为两个单独的列

    C1   name1     type1  2
    C1   name2     type4  6
    C2   name1     type2  1
    C1   name3     type1  10

谢谢


I have a text file in the following format:

   "C1","name1","type1": 2
   "C1","name2","type4": 6
   "C2","name1","type2": 1
   "C1","name3","type1": 10

I tried:

   db<- read.table("myfile.txt")

but this stores the file as two column and store the values as "name1", I also tried :

  db<- read.csv("myfile.txt", header= FALSE)

but this stores the last two columns as one column:

    C1   name1     type1:2
    C1   name2     type4:6
    C2   name1     type2:1
    C1   name3     type1:10

How can store the last two columns as two separate columns

    C1   name1     type1  2
    C1   name2     type4  6
    C2   name1     type2  1
    C1   name3     type1  10

thanks


原文:https://stackoverflow.com/questions/47595260
更新时间:2022-02-24 15:02

最满意答案

据我所知,没有直接的方法来确定是否可以嵌入字体。 我做了一个快速搜索,除了使用Erik在评论中提到的异常catch方法之外,我认为不可能。

 // 1) have a list of all fonts ArrayList allAvailableFonts;
 // 2) second list of fonts that that can be embedded ArrayList embedableFonts;

//Iterate through every available font in allAvailableFonts

for( .... allAvailableFonts ..... )
{
   boolean isFontEmbeddable = true;
   try
   {
          // try to embed the font
   }
   catch( DocumentException de)
   {
        //this font cannot be embedded
        isEmbeddable = false;
   } 

   if( isEmbeddable )
   {
       // add to list of embeddable fonts
       embedableFonts.add ( font );
   }
}

您可以真正去硬核并执行对Windows Apis的本机调用以获得相同的结果,但我认为这对于一个简单的任务来说太多了。

做了一些研究,发现Java如何抛出这个异常

可以在此处找到生成上述异常的代码。 http://kickjava.com/src/com/lowagie/text/pdf/TrueTypeFont.java.htm行号367,368

if (!justNames && embedded && os_2.fsType == 2)
     throw new DocumentException(fileName + style + " cannot be embedded due to licensing restrictions.");

需要注意的有趣部分是条件os_2.fsType == 2

os_2是WindowsMetrics一个实例,请参见第174行http://kickjava.com/src/com/lowagie/text/pdf/TrueTypeFont.java.htm

在Google中搜索WindowsMetrics,这就是我所获得的。

这解释了参数fsType保存了是否可以嵌入字体的信息。 http://www.microsoft.com/typography/otspec/os2ver3.htm#fst

在itext中使用的java等价的WindowsMetrics http://www.docjar.org/docs/api/com/lowagie/text/pdf/TrueTypeFont.WindowsMetrics.html


As far as I can tell there is no direct way to identify whether a font can be embedded. I did a quick search and I don't think it is possible other than using the exception catch method as mentioned by Erik in the comments.

 // 1) have a list of all fonts ArrayList allAvailableFonts;
 // 2) second list of fonts that that can be embedded ArrayList embedableFonts;

//Iterate through every available font in allAvailableFonts

for( .... allAvailableFonts ..... )
{
   boolean isFontEmbeddable = true;
   try
   {
          // try to embed the font
   }
   catch( DocumentException de)
   {
        //this font cannot be embedded
        isEmbeddable = false;
   } 

   if( isEmbeddable )
   {
       // add to list of embeddable fonts
       embedableFonts.add ( font );
   }
}

You can probably go really hardcore and execute native calls to Windows Apis to get the same result, but I think its too much work for a simple task as this.

Did some research and found out how this exception is thrown by Java

The code which generates the above exception can be found here. http://kickjava.com/src/com/lowagie/text/pdf/TrueTypeFont.java.htm Line number 367,368

if (!justNames && embedded && os_2.fsType == 2)
     throw new DocumentException(fileName + style + " cannot be embedded due to licensing restrictions.");

The interesting part to note is the condition os_2.fsType == 2

os_2 is an instance of WindowsMetrics see line 174 here http://kickjava.com/src/com/lowagie/text/pdf/TrueTypeFont.java.htm

A search for WindowsMetrics in Google and this was what I got.

This explains that the parameter fsType holds information whether font can be embedded. http://www.microsoft.com/typography/otspec/os2ver3.htm#fst

The java equivalent of WindowsMetrics as used in itext http://www.docjar.org/docs/api/com/lowagie/text/pdf/TrueTypeFont.WindowsMetrics.html

相关问答

更多

相关文章

更多

最新问答

更多
  • 您如何使用git diff文件,并将其应用于同一存储库的副本的本地分支?(How do you take a git diff file, and apply it to a local branch that is a copy of the same repository?)
  • 将长浮点值剪切为2个小数点并复制到字符数组(Cut Long Float Value to 2 decimal points and copy to Character Array)
  • OctoberCMS侧边栏不呈现(OctoberCMS Sidebar not rendering)
  • 页面加载后对象是否有资格进行垃圾回收?(Are objects eligible for garbage collection after the page loads?)
  • codeigniter中的语言不能按预期工作(language in codeigniter doesn' t work as expected)
  • 在计算机拍照在哪里进入
  • 使用cin.get()从c ++中的输入流中丢弃不需要的字符(Using cin.get() to discard unwanted characters from the input stream in c++)
  • No for循环将在for循环中运行。(No for loop will run inside for loop. Testing for primes)
  • 单页应用程序:页面重新加载(Single Page Application: page reload)
  • 在循环中选择具有相似模式的列名称(Selecting Column Name With Similar Pattern in a Loop)
  • System.StackOverflow错误(System.StackOverflow error)
  • KnockoutJS未在嵌套模板上应用beforeRemove和afterAdd(KnockoutJS not applying beforeRemove and afterAdd on nested templates)
  • 散列包括方法和/或嵌套属性(Hash include methods and/or nested attributes)
  • android - 如何避免使用Samsung RFS文件系统延迟/冻结?(android - how to avoid lag/freezes with Samsung RFS filesystem?)
  • TensorFlow:基于索引列表创建新张量(TensorFlow: Create a new tensor based on list of indices)
  • 企业安全培训的各项内容
  • 错误:RPC失败;(error: RPC failed; curl transfer closed with outstanding read data remaining)
  • C#类名中允许哪些字符?(What characters are allowed in C# class name?)
  • NumPy:将int64值存储在np.array中并使用dtype float64并将其转换回整数是否安全?(NumPy: Is it safe to store an int64 value in an np.array with dtype float64 and later convert it back to integer?)
  • 注销后如何隐藏导航portlet?(How to hide navigation portlet after logout?)
  • 将多个行和可变行移动到列(moving multiple and variable rows to columns)
  • 提交表单时忽略基础href,而不使用Javascript(ignore base href when submitting form, without using Javascript)
  • 对setOnInfoWindowClickListener的意图(Intent on setOnInfoWindowClickListener)
  • Angular $资源不会改变方法(Angular $resource doesn't change method)
  • 在Angular 5中不是一个函数(is not a function in Angular 5)
  • 如何配置Composite C1以将.m和桌面作为同一站点提供服务(How to configure Composite C1 to serve .m and desktop as the same site)
  • 不适用:悬停在悬停时:在元素之前[复制](Don't apply :hover when hovering on :before element [duplicate])
  • 常见的python rpc和cli接口(Common python rpc and cli interface)
  • Mysql DB单个字段匹配多个其他字段(Mysql DB single field matching to multiple other fields)
  • 产品页面上的Magento Up出售对齐问题(Magento Up sell alignment issue on the products page)