Used and loved by millions
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.
Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com
data contain
Grammar usage guide and real-world examplesUSAGE SUMMARY
The phrase "data contain" is not correct in standard written English.
It should be used in a context where "data" is treated as a plural noun, typically followed by a plural verb. Example: "The data contain valuable insights that can help us improve our strategy."
⚠ May contain grammatical issues
Science
News & Media
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Human-verified examples from authoritative sources
Exact Expressions
60 human-written examples
Because newly released data contain the first faint hope that, not only is the economy in New York City not getting worse, it may actually be getting better.
News & Media
The XML data contain hierarchy of heterogeneous CAD assemblies.
Science
The ISMU data contain information on social and economic characteristics.
Science
Most learning algorithms work most effectively when their training data contain completely specified labeled samples.
Science
These data contain clues to changes of glycosyl transferase activity that accompany speciation.
Science
Voxel data describe structural information, whereas SDF data contain details on 3D shapes.
Science
MEG and EEG data contain additive correlated noise generated by environmental and physiological sources.
Science
The functional data contain high dimensionality, high feature correlation, non-stationality, and large amount of noise.
Communications records, often referred to as meta data, contain information about a communication event, but not its content.
News & Media
Network data contain configurations, signal strength, traffic load, and interference information.
These actual logistics data contain information about the delivery status of required material.
Science
Expert writing Tips
Best practice
Prefer using alternatives like "data include", "data encompass", or "data hold" for better grammatical accuracy and clarity in formal writing. When referring to "data", treat it as a plural noun.
Common error
Avoid using "data contain" as it implies singular agreement with a plural noun. Always ensure verb agreement by using "data include" or structuring the sentence to use "dataset contains".
Source & Trust
84%
Authority and reliability
3.8/5
Expert rating
Real-world application tested
Linguistic Context
The phrase "data contain" functions as a predicate in sentences, aiming to express that data encompasses, includes, or holds specific information. While widely used, it presents grammatical issues due to subject-verb disagreement, as highlighted by Ludwig AI.
Frequent in
Science
77%
News & Media
15%
Formal & Business
8%
Less common in
Academia
0%
Encyclopedias
0%
Wiki
0%
Ludwig's WRAP-UP
The phrase "data contain" is frequently used to indicate that data encompasses specific information. However, Ludwig AI points out that this construction isn't grammatically correct. While prevalent, especially in scientific and news contexts, it's advisable to use alternatives like "data include", "data encompass", or "data hold" for clearer and more accurate writing. When referring to datasets, consider restructuring sentences to use "dataset contains" to ensure subject-verb agreement. Prioritizing grammatical precision enhances credibility and clarity, particularly in formal communications.
More alternative expressions(10)
Phrases that express similar concepts, ordered by semantic similarity:
the dataset contains
Uses "dataset" as singular and corrects the verb conjugation to "contains."
data include
Replaces "contain" with "include", a more grammatically sound option in standard English.
data hold
A more direct substitution for "contain", indicating the data possesses certain information.
data encompass
Uses "encompass" for a broader, more comprehensive sense of data inclusion.
data show
A simple and direct way to express that data demonstrates something.
data consist of
Employs "consist of" to describe the specific elements that the data comprises.
data comprise
Similar to "consist of", but with "comprise" indicating what the data is made up of.
data incorporate
Suggests that the data integrates or combines various elements.
data present
Indicates that the data offers or showcases specific details.
data reveal
Highlights that the data uncovers or discloses certain aspects.
FAQs
How can I use "data contain" correctly in a sentence?
While "data contain" is commonly used, it's grammatically safer to use "data include" or restructure the sentence to use "dataset contains".
What are some alternatives to "data contain"?
Alternatives include phrases like "data include", "data encompass", or "data hold", depending on the intended meaning.
Is it better to say "data contain" or "data contains"?
Neither is ideal. Although "data" is plural, "data contains" is not grammatically correct because the subject, 'data', is already plural. Consider using "dataset contains" if referring to a single dataset.
What's the difference between "data contain" and "data includes"?
"Data includes" is grammatically correct because 'includes' agrees with the plural noun 'data'. Avoid using "data contain" in formal writing.
Editing plus AI, all in one place.
Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Source & Trust
84%
Authority and reliability
3.8/5
Expert rating
Real-world application tested