Used and loved by millions
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com
data comprise
Grammar usage guide and real-world examplesUSAGE SUMMARY
The phrase "data comprise" is correct and usable in written English.
You would typically use this phrase to refer to the components that make up a set of data. For example, "The data comprise sales figures from the past month."
✓ Grammatically correct
Science
News & Media
Alternative expressions(5)
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Human-verified examples from authoritative sources
Exact Expressions
60 human-written examples
Unfortunately, many data models in the Web of Data comprise very few or no constraints at all, so relying on constraints to generate schema mappings is not appealing.
Univariate data comprise only patients eligible for multivariable analysis.
Science
All the data comprise multiple heterogeneous data repositories.
The recorded data comprise non-line-of-sight, obstructed line-of-sight, and line-of-sight conditions.
The data comprise almost 3 million data points, covering over 20,000 unique players and more than 650 products.
Science
This is expected, since EPIC data comprise a much easier domain than the MAVIR data for ASR, and hence from an STD perspective as well.
The empirical data comprise total number of dailyd deaths caused by the Ebola virus as calculated by "OMS" of four African countries.
Science
Our data comprise a set of messages publicly exchanged through http://www.twitter.com from the 1st of March, 2011, to the 31st of March , 2012
Science
The experimental data comprise a series of 12 tests in pure torsion and an additional database of experimental information for 24 specimens compiled from works around the world.
Science
Not only does the data comprise drug-protein, protein-protein, protein-metabolite and metabolite-disease interactions, but they're including data that's essentially never been analyzed: "We're looking at metabolites that no one has looked at before".
News & Media
FTC data reflect acquisitions reported by Moody's Investors Service and Standard & Poor's Corp. Post-1973 data comprise all merger announcements, including those not actually consummated, and are therefore not strictly comparable with the FTC's figures.
News & Media
Expert writing Tips
Best practice
When using "data comprise", ensure the subject is plural (data) as it dictates the plural verb form. Remember that "data" is the plural form of "datum".
Common error
Avoid using a singular verb form with "data". It's incorrect to say "the data comprises"; instead, use "the data comprise".
Source & Trust
81%
Authority and reliability
4.5/5
Expert rating
Real-world application tested
Linguistic Context
The phrase "data comprise" functions as a declarative statement, asserting what elements make up a set of data. As supported by Ludwig, it's a grammatically sound way to express composition.
Frequent in
Science
85%
News & Media
10%
Formal & Business
5%
Less common in
Academia
0%
Encyclopedias
0%
Wiki
0%
Ludwig's WRAP-UP
In summary, "data comprise" is a grammatically correct and frequently used phrase to indicate the elements that constitute a dataset. Ludwig AI confirms its validity. While it's most common in scientific contexts, it's also appropriate for news and media, and formal business writing. Remember that "data" is plural, so the verb must also be plural. Avoid the common mistake of using a singular verb form like "comprises". Alternative phrases such as "data include" or "data consist of" can be used for variety.
More alternative expressions(6)
Phrases that express similar concepts, ordered by semantic similarity:
data encompass
This alternative suggests a broader inclusion of elements within the data.
data include
Focuses on listing specific items contained within the data.
data consist of
Emphasizes the components that make up the data in its entirety.
data are composed of
Similar to "data consist of", highlighting the constituent parts.
data constitute
This is a more formal way of saying what the data are made up of.
data embody
Suggests that the data are a tangible representation of something.
data incorporate
Indicates the data brings elements together into a unified whole.
data contain
Highlights the data's capacity to hold various elements.
data embrace
Implies a comprehensive and accepting inclusion of different elements.
data cover
This term indicates the scope or range of the data.
FAQs
How to use "data comprise" in a sentence?
Use "data comprise" to indicate what elements constitute a dataset. For example, "The sales data comprise figures from the last quarter".
What can I say instead of "data comprise"?
You can use alternatives like "data include", "data consist of", or "data encompass" depending on the context.
Which is correct, "data comprise" or "data comprises"?
"Data comprise" is correct. "Data" is a plural noun, so it requires the plural verb form "comprise". The phrase "data comprises" is grammatically incorrect.
What's the difference between "data comprise" and "data consist of"?
While both phrases are similar, "data comprise" directly states the components, whereas "data consist of" emphasizes that the data are made up of those components. The phrase "data consist of" is used to describe the essence of the components.
Editing plus AI, all in one place.
Stop switching between tools. Your AI writing partner for everything—polishing proposals, crafting emails, finding the right tone.
Table of contents
Usage summary
Human-verified examples
Expert writing tips
Linguistic context
Ludwig's wrap-up
Alternative expressions
FAQs
Source & Trust
81%
Authority and reliability
4.5/5
Expert rating
Real-world application tested