在分析 somewhat old academic paper 中的一些数据时,我收到了一些由更旧的软件生成的数据文件,即已有 20 多年历史的 SAS 版本 6(.ssd01
扩展名)。
如何将此文件转换为现代格式以便使用 R、Python 等进行分析?如果该过程不需要获得 SAS 许可证或其他商业软件,则可获得奖励(赏金)点。
提示:
- Reading SAS® data sets with a filename extension such as .ssd01 or .ssd
- Pandas 有一个
read_sas
method ,但仅适用于 SAS 版本 7 和更高版本的数据 (*.sas7bdat
) - R 对
.sas7bdat
有类似的导入函数,但这也不适用于此版本:Read SAS sas7bdat data into R
以下是这些文件,以防有人想动手(不,它们不是病毒,只是上述论文中的数据):
最佳答案
我刚刚从 https://www.stattransfer.com 获取了该软件的 Windows 64 位版本的演示版并将其应用到您的第一个文件,要求 csv 输出。这是结果的前几行。
"TIME","PLAYERID","PLAY","PAY","N1","N1PLAY","N1PAY","N2","N2PLAY","N2PAY","N3","N3PLAY","N3PAY","N4","N4PLAY","N4PAY","NN","N5","N5PLAY","N5PAY","N6","N6PLAY","N6PAY","N7","N7PLAY","N7PAY","ACTION","N1ACT","N2ACT","N3ACT","N4ACT","N5ACT","N6ACT","N7ACT","NETWORK","GAME","SESSION","LAGACTON","N1ACTO","N2ACTO","N3ACTO","N4ACTO","N5ACTO","N6ACTO","N7ACTO","ACTS","LAGACTS","PROPORT","LAGPROP","GRAPH","CLUSTER","LENGHT"
2,0,"B",3.2,1,"A",2,2,"A",2,17,"B",3.2,16,"A",2,4,,"",,,"",,,"",,0,1,1,0,1,,,,"local","co","colc1fir",,1,1,0,1,0,0,0,3,,0.75,,"local",0.5,2.647
3,0,"A",0.5,1,"B",2.5,2,"B",2.5,17,"B",2.5,16,"A",0.5,4,,"",,,"",,,"",,1,0,0,0,1,,,,"local","co","colc1fir",0,0,0,0,1,0,0,0,1,3,0.25,0.75,"local",0.5,2.647
4,0,"B",2.5,1,"A",-1,2,"B",1.8,17,"B",3.2,16,"A",0.5,4,,"",,,"",,,"",,0,1,0,0,1,,,,"local","co","colc1fir",1,1,0,0,1,0,0,0,2,1,0.5,0.25,"local",0.5,2.647
5,0,"B",2.5,1,"B",2.5,2,"A",-1,17,"A",0.5,16,"B",3.2,4,,"",,,"",,,"",,0,0,1,1,0,,,,"local","co","colc1fir",0,0,1,1,0,0,0,0,2,2,0.5,0.5,"local",0.5,2.647
6,0,"A",2,1,"B",2.5,2,"A",2,17,"B",3.2,16,"A",3.5,4,,"",,,"",,,"",,1,0,1,0,1,,,,"local","co","colc1fir",0,0,1,0,1,0,0,0,2,2,0.5,0.5,"local",0.5,2.647
7,0,"B",1.8,1,"B",1.8,2,"B",2.5,17,"B",2.5,16,"A",2,4,,"",,,"",,,"",,0,0,0,0,1,,,,"local","co","colc1fir",1,0,0,0,1,0,0,0,1,2,0.25,0.5,"local",0.5,2.647
8,0,"A",2,1,"B",2.5,2,"B",2.5,17,"A",3.5,16,"A",3.5,4,,"",,,"",,,"",,1,0,0,1,1,,,,"local","co","colc1fir",0,0,0,1,1,0,0,0,2,1,0.5,0.25,"local",0.5,2.647
9,0,"B",2.5,1,"B",1.8,2,"B",1.8,17,"A",2,16,"A",3.5,4,,"",,,"",,,"",,0,0,0,1,1,,,,"local","co","colc1fir",1,0,0,1,1,0,0,0,2,2,0.5,0.5,"local",0.5,2.647
10,0,"B",1.8,1,"B",1,2,"B",1.8,17,"B",2.5,16,"A",2,4,,"",,,"",,,"",,0,0,0,0,1,,,,"local","co","colc1fir",0,0,0,0,1,0,0,0,1,2,0.25,0.5,"local",0.5,2.647
11,0,"B",3.2,1,"A",0.5,2,"B",1.8,17,"A",3.5,16,"A",3.5,4,,"",,,"",,,"",,0,1,0,1,1,,,,"local","co","colc1fir",0,1,0,1,1,0,0,0,3,1,0.75,0.25,"local",0.5,2.647
12,0,"B",2.5,1,"A",0.5,2,"B",2.5,17,"B",3.2,16,"A",2,4,,"",,,"",,,"",,0,1,0,0,1,,,,"local","co","colc1fir",0,1,0,0,1,0,0,0,2,3,0.5,0.75,"local",0.5,2.647
13,0,"B",2.5,1,"A",0.5,2,"B",3.2,17,"B",3.2,16,"A",0.5,4,,"",,,"",,,"",,0,1,0,0,1,,,,"local","co","colc1fir",0,1,0,0,1,0,0,0,2,2,0.5,0.5,"local",0.5,2.647
14,0,"B",3.2,1,"A",2,2,"A",3.5,17,"B",3.2,16,"A",2,4,,"",,,"",,,"",,0,1,1,0,1,,,,"local","co","colc1fir",0,1,1,0,1,0,0,0,3,2,0.75,0.5,"local",0.5,2.647
我不知道这有多好! :) 不,我与任何制造或销售此产品的公司没有任何关系。而且,不,我以前从未尝试过。你现在(几乎)和我一样知道了关于它的一切。
祝你好运。
关于python - 从(古代)SAS 版本 6(适用于 Python/R)读取 .ssd01 数据文件,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/43351594/