python - Polars: Specify dtypes for all columns at once in read_csv

相关文章推荐

无聊的火锅 · Python中正则匹配TAB及空格的小技巧_ ...· 5 天前 ·

坏坏的红金鱼 · python正则匹配空格 - CSDN文库· 5 天前 ·

飘逸的米饭 · python 正则 re模块(详细版) - ...· 5 天前 ·

豪情万千的米饭 · Python 删除字符串首尾的空格 | · 5 天前 ·

爱运动的铁链 · Python编程：正则表达式怎么匹配“空白” ...· 5 天前 ·

慷慨的蚂蚁 · AVG (Transact-SQL) - ...· 2 周前 ·

非常酷的红茶 · SQLite批量插入效率_sqlite ...· 1 年前 ·

爱搭讪的皮带 · 课题组都在用的机器学习画图模板! ...· 1 年前 ·

有胆有识的黑框眼镜 · git无法clone远程代码库及git代理设 ...· 1 年前 ·

腼腆的卡布奇诺 · torch.distributed.barr ...· 1 年前 ·

Collectives™ on Stack Overflow

Find centralized, trusted content and collaborate around the technologies you use most.

Learn more about Collectives

Teams

Q&A for work

Connect and share knowledge within a single location that is structured and easy to search.

Learn more about Teams

In Polars , how can one specify a single dtype for all columns in read_csv ?

According to the docs , the dtypes argument to read_csv can take either a mapping (dict) in the form of {'column_name': dtype} , or a list of dtypes, one for each column. However, it is not clear how to specify "I want all columns to be a single dtype".

If you wanted all columns to be Utf-8 for example and you knew the total number of columns, you could do:

pl.read_csv('sample.csv', dtypes=[pl.Utf8]*number_of_columns)
However, this doesn't work if you don't know the total number of columns.
In Pandas, you could do something like:
pd.read_csv('sample.csv', dtype=str)
But this doesn't work in Polars.
Reading all data in a csv to any other type than pl.Utf8 likely fails with a lot of null values. We can use expressions to declare how we want to deal with those null values.
If you read a csv with infer_schema_length=0, polars does not know the schema and will read all columns as pl.Utf8 as that is a super type of all polars types.
When read as Utf8 we can use expressions to cast all columns.
(pl.read_csv("test.csv", infer_schema_length=0)
   .with_columns(pl.all().cast(pl.Int32, strict=False))
        Thanks for contributing an answer to Stack Overflow!
Please be sure to answer the question. Provide details and share your research!
But avoid …
Asking for help, clarification, or responding to other answers.
Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.

推荐文章

无聊的火锅 · Python中正则匹配TAB及空格的小技巧_python正则表达式空格

5 天前

坏坏的红金鱼 · python正则匹配空格 - CSDN文库

5 天前

飘逸的米饭 · python 正则 re模块(详细版) - 风，又奈何

5 天前

豪情万千的米饭 · Python 删除字符串首尾的空格 |

5 天前

爱运动的铁链 · Python编程：正则表达式怎么匹配“空白”字符-百度经验

5 天前

慷慨的蚂蚁 · AVG (Transact-SQL) - SQL Server | Microsoft Learn

2 周前

非常酷的红茶 · SQLite批量插入效率_sqlite 批量插入-CSDN博客

1 年前

爱搭讪的皮带 · 课题组都在用的机器学习画图模板! 开放下载啦_自动驾驶之心的博客-CSDN博客

1 年前

有胆有识的黑框眼镜 · git无法clone远程代码库及git代理设置 - 秋楓 - 博客园

1 年前

腼腆的卡布奇诺 · torch.distributed.barrier() - 哈哈哈喽喽喽 - 博客园

1 年前