Used and loved by millions
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.
Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com
data were split
Grammar usage guide and real-world examplesUSAGE SUMMARY
The phrase "data were split" is correct and usable in written English.
It can be used in contexts involving data analysis, research, or statistics, where data sets are divided into different parts for analysis or processing. Example: "In the study, the data were split into training and testing sets to evaluate the model's performance."
✓ Grammatically correct
Science
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Human-verified examples from authoritative sources
Exact Expressions
60 human-written examples
Parent data were split into lower and higher BAP groups.
The data were split into three categories: training of the algorithms (685 patients), validation (172 patients) and test (150 patients).
Academia
This was a "three color" image of the Crab Nebula, where the X-ray data were split into three different energy bands.
Academia
To do this, data were split randomly into a training and a test sets, then the model was trained with the training sample and its performance was assessed using the test sample.
Science
Furthermore, the data were split into two equal-sized training and test subsets by the Kennard-Stone design and the errors of calibration (RMSEC) and prediction (RMSEP) were calculated.
The data were split randomly into train and test subsets.
The available Pfizer data were split into two categories: IC50 and percent inhibition data.
Science
The data were split into two categories: 1) remarks on the items and 2) remarks on the questionnaire in general.
The corpus data were split into 64 MB blocks (Hadoop default), and loaded into the Hadoop Distributed Filesystem (HDFS).
Data were split into two time frames of 30 min each, and images were reconstructed as described above.
Science
However, very often the discontinuities are not known or cannot be quantified and in these cases the observatory data were split into two or more series.
Science
Expert writing Tips
Best practice
When describing data splitting, specify the criteria used for splitting (e.g., randomly, by date, by category) to provide clarity.
Common error
Avoid vague statements like "The data were split". Instead, provide details such as the purpose of splitting the data, the method used, and the resulting subgroups.
Source & Trust
86%
Authority and reliability
4.5/5
Expert rating
Real-world application tested
Linguistic Context
The phrase "data were split" functions as a passive construction describing an action performed on the data. As Ludwig AI explains, it's commonly used in data analysis to indicate that a dataset has been divided for specific purposes. The examples in Ludwig demonstrate its use in various scientific contexts.
Frequent in
Science
98%
Academia
2%
Formal & Business
0%
Less common in
News & Media
0%
Encyclopedias
0%
Wiki
0%
Ludwig's WRAP-UP
In summary, "data were split" is a grammatically sound and widely used phrase, according to Ludwig AI, especially within scientific and academic writing to describe data analysis methodologies. It signifies that a dataset has been divided for specific purposes, such as training and testing models. While the phrase is correct, providing details about the how and why is crucial for clarity. Alternatives like "data were divided" or "data were partitioned" can be used depending on the context. The phrase maintains a formal and scientific register, primarily appearing in research papers and technical documentation. It is a standard term in scientific methodology.
More alternative expressions(6)
Phrases that express similar concepts, ordered by semantic similarity:
data were divided
Replaces "split" with the synonym "divided", maintaining the passive voice and past tense, but altering the specific verb.
data were separated
Indicates a more general division, without necessarily implying a structured methodology.
data were partitioned
Employs "partitioned" to suggest a more structured or planned division of data, carrying a slightly more formal connotation.
data were segmented
Uses "segmented" to highlight that the data was divided in order to create individual subsets.
data were subsetted
Highlights process of creating subsets from the larger dataset.
data were allocated
Suggests a division with a specific purpose of distribution, often for training or testing.
data were categorized
Emphasizes the classification aspect of splitting data, focusing on grouping data into distinct categories.
data were stratified
Implies a division based on specific layers or levels within the data.
data were grouped
Highlights data clustering based on shared characteristics.
data were binned
Specifies that the data was divided into bins, likely by quantitative ranges.
FAQs
How can I use "data were split" in a sentence?
You can use "data were split" to describe how a dataset was divided for analysis, such as "The data were split into training and testing sets" or "The data were split by demographic categories".
What's a good alternative to "data were split"?
Alternatives include "data were divided", "data were partitioned", or "data were segmented". The best choice depends on the specific context and the nature of the division.
Is it better to say "data was split" or "data were split"?
Since "data" is generally treated as a plural noun in scientific and technical contexts, "data were split" is the more grammatically correct and widely accepted option. However, in informal contexts, "data was split" might be encountered.
What does it mean when someone says, "the data were split into training and test sets"?
It means the original dataset was divided into two subsets: a training set used to build a model and a test set used to evaluate its performance. This process is common in machine learning and statistical modeling.
Editing plus AI, all in one place.
Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Source & Trust
86%
Authority and reliability
4.5/5
Expert rating
Real-world application tested