Used and loved by millions

Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak quote

Justyna Jupowicz-Kozak

CEO of Professional Science Editing for Scientists @ prosciediting.com

MitStanfordHarvardAustralian Nationa UniversityNanyangOxford

data were split

Grammar usage guide and real-world examples

USAGE SUMMARY

The phrase "data were split" is correct and usable in written English.
It can be used in contexts involving data analysis, research, or statistics, where data sets are divided into different parts for analysis or processing. Example: "In the study, the data were split into training and testing sets to evaluate the model's performance."

✓ Grammatically correct

Science

Human-verified examples from authoritative sources

Exact Expressions

60 human-written examples

Parent data were split into lower and higher BAP groups.

The data were split into three categories: training of the algorithms (685 patients), validation (172 patients) and test (150 patients).

This was a "three color" image of the Crab Nebula, where the X-ray data were split into three different energy bands.

To do this, data were split randomly into a training and a test sets, then the model was trained with the training sample and its performance was assessed using the test sample.

Furthermore, the data were split into two equal-sized training and test subsets by the Kennard-Stone design and the errors of calibration (RMSEC) and prediction (RMSEP) were calculated.

The data were split randomly into train and test subsets.

The available Pfizer data were split into two categories: IC50 and percent inhibition data.

The data were split into two categories: 1) remarks on the items and 2) remarks on the questionnaire in general.

The corpus data were split into 64 MB blocks (Hadoop default), and loaded into the Hadoop Distributed Filesystem (HDFS).

Data were split into two time frames of 30 min each, and images were reconstructed as described above.

However, very often the discontinuities are not known or cannot be quantified and in these cases the observatory data were split into two or more series.

Show more...

Expert writing Tips

Best practice

When describing data splitting, specify the criteria used for splitting (e.g., randomly, by date, by category) to provide clarity.

Common error

Avoid vague statements like "The data were split". Instead, provide details such as the purpose of splitting the data, the method used, and the resulting subgroups.

Antonio Rotolo, PhD - Digital Humanist | Computational Linguist | CEO @Ludwig.guru

Antonio Rotolo, PhD

Digital Humanist | Computational Linguist | CEO @Ludwig.guru

Source & Trust

86%

Authority and reliability

4.5/5

Expert rating

Real-world application tested

Linguistic Context

The phrase "data were split" functions as a passive construction describing an action performed on the data. As Ludwig AI explains, it's commonly used in data analysis to indicate that a dataset has been divided for specific purposes. The examples in Ludwig demonstrate its use in various scientific contexts.

Expression frequency: Very common

Frequent in

Science

98%

Academia

2%

Formal & Business

0%

Less common in

News & Media

0%

Encyclopedias

0%

Wiki

0%

Ludwig's WRAP-UP

In summary, "data were split" is a grammatically sound and widely used phrase, according to Ludwig AI, especially within scientific and academic writing to describe data analysis methodologies. It signifies that a dataset has been divided for specific purposes, such as training and testing models. While the phrase is correct, providing details about the how and why is crucial for clarity. Alternatives like "data were divided" or "data were partitioned" can be used depending on the context. The phrase maintains a formal and scientific register, primarily appearing in research papers and technical documentation. It is a standard term in scientific methodology.

FAQs

How can I use "data were split" in a sentence?

You can use "data were split" to describe how a dataset was divided for analysis, such as "The data were split into training and testing sets" or "The data were split by demographic categories".

What's a good alternative to "data were split"?

Alternatives include "data were divided", "data were partitioned", or "data were segmented". The best choice depends on the specific context and the nature of the division.

Is it better to say "data was split" or "data were split"?

Since "data" is generally treated as a plural noun in scientific and technical contexts, "data were split" is the more grammatically correct and widely accepted option. However, in informal contexts, "data was split" might be encountered.

What does it mean when someone says, "the data were split into training and test sets"?

It means the original dataset was divided into two subsets: a training set used to build a model and a test set used to evaluate its performance. This process is common in machine learning and statistical modeling.

ChatGPT power + Grammarly precisionChatGPT power + Grammarly precision
ChatGPT + Grammarly

Editing plus AI, all in one place.

Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.

Source & Trust

86%

Authority and reliability

4.5/5

Expert rating

Real-world application tested

Most frequent sentences: