Used and loved by millions

Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak quote

Justyna Jupowicz-Kozak

CEO of Professional Science Editing for Scientists @ prosciediting.com

MitStanfordHarvardAustralian Nationa UniversityNanyangOxford

data contain

Grammar usage guide and real-world examples

USAGE SUMMARY

The phrase "data contain" is not correct in standard written English.
It should be used in a context where "data" is treated as a plural noun, typically followed by a plural verb. Example: "The data contain valuable insights that can help us improve our strategy."

⚠ May contain grammatical issues

Science

News & Media

Human-verified examples from authoritative sources

Exact Expressions

60 human-written examples

Because newly released data contain the first faint hope that, not only is the economy in New York City not getting worse, it may actually be getting better.

News & Media

The New York Times

The XML data contain hierarchy of heterogeneous CAD assemblies.

The ISMU data contain information on social and economic characteristics.

Most learning algorithms work most effectively when their training data contain completely specified labeled samples.

These data contain clues to changes of glycosyl transferase activity that accompany speciation.

Science

Placenta

Voxel data describe structural information, whereas SDF data contain details on 3D shapes.

MEG and EEG data contain additive correlated noise generated by environmental and physiological sources.

Science

NeuroImage

The functional data contain high dimensionality, high feature correlation, non-stationality, and large amount of noise.

Communications records, often referred to as meta data, contain information about a communication event, but not its content.

News & Media

TechCrunch

Network data contain configurations, signal strength, traffic load, and interference information.

These actual logistics data contain information about the delivery status of required material.

Show more...

Expert writing Tips

Best practice

Prefer using alternatives like "data include", "data encompass", or "data hold" for better grammatical accuracy and clarity in formal writing. When referring to "data", treat it as a plural noun.

Common error

Avoid using "data contain" as it implies singular agreement with a plural noun. Always ensure verb agreement by using "data include" or structuring the sentence to use "dataset contains".

Antonio Rotolo, PhD - Digital Humanist | Computational Linguist | CEO @Ludwig.guru

Antonio Rotolo, PhD

Digital Humanist | Computational Linguist | CEO @Ludwig.guru

Source & Trust

84%

Authority and reliability

3.8/5

Expert rating

Real-world application tested

Linguistic Context

The phrase "data contain" functions as a predicate in sentences, aiming to express that data encompasses, includes, or holds specific information. While widely used, it presents grammatical issues due to subject-verb disagreement, as highlighted by Ludwig AI.

Expression frequency: Very common

Frequent in

Science

77%

News & Media

15%

Formal & Business

8%

Less common in

Academia

0%

Encyclopedias

0%

Wiki

0%

Ludwig's WRAP-UP

The phrase "data contain" is frequently used to indicate that data encompasses specific information. However, Ludwig AI points out that this construction isn't grammatically correct. While prevalent, especially in scientific and news contexts, it's advisable to use alternatives like "data include", "data encompass", or "data hold" for clearer and more accurate writing. When referring to datasets, consider restructuring sentences to use "dataset contains" to ensure subject-verb agreement. Prioritizing grammatical precision enhances credibility and clarity, particularly in formal communications.

FAQs

How can I use "data contain" correctly in a sentence?

While "data contain" is commonly used, it's grammatically safer to use "data include" or restructure the sentence to use "dataset contains".

What are some alternatives to "data contain"?

Alternatives include phrases like "data include", "data encompass", or "data hold", depending on the intended meaning.

Is it better to say "data contain" or "data contains"?

Neither is ideal. Although "data" is plural, "data contains" is not grammatically correct because the subject, 'data', is already plural. Consider using "dataset contains" if referring to a single dataset.

What's the difference between "data contain" and "data includes"?

"Data includes" is grammatically correct because 'includes' agrees with the plural noun 'data'. Avoid using "data contain" in formal writing.

ChatGPT power + Grammarly precisionChatGPT power + Grammarly precision
ChatGPT + Grammarly

Editing plus AI, all in one place.

Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.

Source & Trust

84%

Authority and reliability

3.8/5

Expert rating

Real-world application tested

Most frequent sentences: