python - Sklearn语法错误?

标签 python python-3.x machine-learning scikit-learn sklearn-pandas

我正在通过这个网站(https://pythonprogramming.net/training-testing-machine-learning-tutorial/)学习,第4部分。这是我的代码(复制的,Quandl具有小写q,在新版本中是正确的,出于同样的原因,model_selection而不是cross_validation)。

import quandl, math
import numpy as np
import pandas as pd
from sklearn import preprocessing
from sklearn import model_selection
from sklearn import svm
from sklearn.linear_model import LinearRegression

df = quandl.get("WIKI/GOOGL")
print(df.head())
print(df.tail())
df = df[['Adj. Open',  'Adj. High',  'Adj. Low',  'Adj.                             
             Close', 'Adj. Volume']]

df['HL_PCT'] = (df['Adj. High'] - df['Adj. Low']) /         
df['Adj. Close'] * 100.0
df['PCT_change'] = (df['Adj. Close'] - df['Adj. Open']) /         
    df['Adj. Open'] * 100.0
df = df[['Adj. Close', 'HL_PCT', 'PCT_change', 'Adj. 
   Volume']]
print(df.head())
forecast_col = 'Adj. Close'
df.fillna(value=-99999, inplace=True)
forecast_out = int(math.ceil(0.01 * len(df)))
df['label'] = df[forecast_col].shift(-forecast_out)
df.dropna(inplace=True)
X = np.array(df.drop(['label'], 1))
y = np.array(df['label'])
X = preprocessing.scale(X)
y = np.array(df['label'])
X_train, X_test, y_train, y_test =         
    model_selection.train_test_split(X, y, test_size=0.2)

clf = svm.SVR()
clf.fit(X_train, y_train)
confidence = clf.score(X_test, y_test)
print(confidence)

错误是:

Traceback (most recent call last):
  File "C:/Users/PycharmProjects/learn_python_the_hard_way/LEARN.py", line 4, in <module>
    from sklearn import preprocessing
  File "C:\Users\AppData\Local\Programs\Python\Python36-32\lib\site-packages\sklearn\__init__.py", line 57, in <module>
    from .base import clone
  File "C:\Users\AppData\Local\Programs\Python\Python36-32\lib\site-packages\sklearn\base.py", line 10, in <module>
    from scipy import sparse
  File "C:\Users\PycharmProjects\learn_python_the_hard_way\scipy.py", line 1
    from scip
            ^
SyntaxError: invalid syntax

我不知道如何解决这个问题,任何帮助将不胜感激!

更新: 我创建了 scipy.py 文件,用于在与我现在正在练习的文件相同的文件中进行 scipy 练习,现在将其删除。错误是:

Traceback (most recent call last):
  File "C:/Users/PycharmProjects/learn_python_the_hard_way/LEARN.py", line 4, in <module>
    from sklearn import preprocessing
  File "C:\Users\AppData\Local\Programs\Python\Python36-32\lib\site-packages\sklearn\__init__.py", line 57, in <module>
    from .base import clone
  File "C:\Users\AppData\Local\Programs\Python\Python36-32\lib\site-packages\sklearn\base.py", line 10, in <module>
    from scipy import sparse
  File "C:\Users\AppData\Local\Programs\Python\Python36-32\lib\site-packages\scipy\__init__.py", line 61, in <module>
    from numpy._distributor_init import NUMPY_MKL  # requires numpy+mkl
ImportError: cannot import name 'NUMPY_MKL'

我检查过,numpy 已安装并正在运行!

最佳答案

您有一个名为 scipy.py 的文件(在 C:\Users\PycharmProjects\learn_python_the_hard_way 中),在实际 SciPy 安装之前(可能在您的 Python lib 目录中)找到该文件。您需要重命名它。

关于python - Sklearn语法错误?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/44997163/

相关文章:

python - 在 matplotlib 中添加标签和雷达图之间的间隙

Python单元测试: How to initialize selenium in a class and avoid having the browser opening twice?

python - Sklearn 和 StatsModels 给出了截然不同的逻辑回归答案

python - openai.error.APIConnectionError : Error communicating with OpenAI

Python - 组合类的继承

machine-learning - 提取相关句子到实体

time - 机器学习: optimal parameter values in reasonable time

python-3.x - 检查十进制数的二进制表示是否只存在一个 '1' 的最有效方法

python-3.x - 如何在Elasticsearch中计算每个 token 的总数

machine-learning - Weka 3.8 包管理器不工作