Skip to content

train.py error - Expected 2D array, got 1D array instead #24

@Rhinogradentia

Description

@Rhinogradentia

Hi,

another question.
The tool was installed via conda on python 3.7.

I have the following error when running train.py:

(plassClass) /PlasClass$ train.py -p plasmids.fasta -c genome.fasta -o train/ -n 25
Starting PlasClass training
Getting reference lengths
Sampling 96 fragments for length 1000
Getting k-mer frequencies
Learning classifier
Saving classifier
Sampling 9 fragments for length 10000
Getting k-mer frequencies
Learning classifier
Saving classifier
Sampling 0 fragments for length 100000
Getting k-mer frequencies
Learning classifier
Traceback (most recent call last):
  File "/home/<user>/miniconda3_new/envs/plassClass/bin/train.py", line 197, in <module>
    main(args)
  File "/home/<user>/miniconda3_new/envs/plassClass/bin/train.py", line 193, in main
    train(plasfile,chromfile,outdir,num_procs,ks,lens)
  File "/home/<user>/miniconda3_new/envs/plassClass/bin/train.py", line 172, in train
    scaler = StandardScaler().fit(data)
  File "/home/<user>/miniconda3_new/envs/plassClass/lib/python3.7/site-packages/sklearn/preprocessing/data.py", line 639, in fit
    return self.partial_fit(X, y)
  File "/home/<user>/miniconda3_new/envs/plassClass/lib/python3.7/site-packages/sklearn/preprocessing/data.py", line 663, in partial_fit
    force_all_finite='allow-nan')
  File "/home/<user>/miniconda3_new/envs/plassClass/lib/python3.7/site-packages/sklearn/utils/validation.py", line 521, in check_array
    "if it contains a single sample.".format(array))
ValueError: Expected 2D array, got 1D array instead:
array=[].
Reshape your data either using array.reshape(-1, 1) if your data has a single feature or array.reshape(1, -1) if it contains a single sample.

The fasta files contain ncbi sequences - 4 in the plasmid-file and 7 in the genome file, no empty lines, but only one species.

What might be the reason for this error and what can I do to solve it?

Thank you in advance.
Best,
Nadine

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions