This analysis was prepared by Dr. Brett Baillie and Dr. Marie Connett Porceddu using patent application and patent data provided by Neil Bacon and Doug Ashton from within a larger project funded by the OS4 Initiative (link) . We are grateful for production assistance on many of the pages here by Dr. Kerry Fluhr, Dr. Wei Yang , Dr. Nick dos Remedios, and Nik Hatta.
This landscape deals with the Intellectual property and IP issues surrounding the genome of Arabidopsis thaliana. The landscape is intended to provide insights into the deliverability of innovations in the plant sciences for the public good. To this end the landscape investigates patenting activity directed towards the nucleotide and gene sequences from Arabidopsis - with a particular focus on patent applications involving large groups of DNA sequences. The results of the analysis will be of interest to those involved in public sector research and innovation who are unaware of the ways in which the fruits of public-funded research can be sequestered by private for-profit organisations.
Patent applications that claim Arabidopsis genes have the potential to also claim related genes from other organisms. Hence granted patents on genes in Arabidopsis may effectively grant exclusionary rights to similar genes from important industrial and subsistence crops such as canola, soya, potato, cassava, cotton, poplar and Eucalyptus.
Understanding the landscape of patents on Arabidopsis sequences also allows a better understanding of the degree to which these overlapping patent rights may influence future research and innovation. If findings about the function of Arabidopsis genes provide stronger enablement for composition of matter claims already filed, research done in the public sector may not actually be in the public domain or available for public good uses!
This landscape will provide an overview of some important aspects of the intellectual property claims covering the Arabidopsis genome, which many in the biological sciences consider to be a public good, a model system that enables critical insights into many of the important pathways of crops, weeds and other plants. In particular, we investigated "bulk sequence claims", patent applications that attempt to claim 100 or more Arabidopsis sequences, and found a large number of such patent applications by Monsanto, Syngenta, Dow, Dupont, Bayer, and BASF and their subsidiaries and researchers who work with these companies under exclusive contracts.
This landscape is not intended to provide a legal opinion about freedom to operate using Arabidopsis sequences. The nature of patenting systems worldwide means that new patents and patent applications may appear at any time. Similarly, patents may lapse or patent applications may expire or be replaced by new applications. During the course of preparation of this landscape in early 2007, a series of new publications appeared, providing information for the first time on applications encumbering thousands of sequences, filed up to seven years earlier. This provided a firm reminder that it is not possible to take for granted that if a topic is not readily found in a search of published patents and patent applications, there is freedom to operate! So, although we have tried to give the best coverage of the intellectual property surrounding Arabidopsis sequences, this landscape should not be viewed as a comprehensive coverage of the subject.
Below are some important patent law concepts that are important for understanding the Rice Genome Landscape:
Arabidopsis thaliana is a model plant used in research to study many aspects of plant biology. Originating in Europe and Central Asia, it is a dicot, as are many important and culturally significant staple crops, such as potato; commercially important food crops, such as soybean; and fiber crops, such as cotton and hardwood trees. Unlike many other members of the family Brassicaceae (the same family contains canola, broccoli, cabbage, mustard and other food and oilseed crop species), Arabidopsis thaliana is not grown for food or oil production, and is not agronomically important in itself. Plants are quite small (see Figure 1 below).
Although in the genus Arabidopsis there are approximately 9 species and 8 sub-species (see the entry for Arabidopsis in Wikipedia), it is not uncommon for plant biology researchers to use "Arabidopsis" to mean Arabidopsis thaliana, the convention used in this landscape.
![]()
Figure 1: Arabidopsis thaliana seedlings growing in tissue culture (seedling ~10mm in width)
The popularity of as a model for all plant processes has given rise to numerous methods and protocols for its use in research. Because germination through to senescence is only approximately 50 days (The Arabidopsis Information Resource (TAIR)), Arabidopsis offers a fast system in which to study processes that may take months or years in other flowering plants. The availability of large numbers of mutants and T-DNA insertion lines (e.g. The European Arabidopsis Stock Centre, NASC) has made it useful in understanding the role of many plant genes, particularly those involved in development, metabolism, and disease resistance.
Arabidopsis was chosen as the first plant species for a public whole genome sequencing effort because of its small genome size. The table below shows the genome sizes of some commonly-studied organisms, including Arabidopsis.
Organism
Common Name
Genome Size
Escherichia coli K12 E.coli 4.64 Mb
Saccharomyces cerevisiae YJM789 baker's yeast
16 Mb
Arabidopsis thaliana thale cress
120 Mb
Oryza sativa rice
430 Mb
Lycopersicon esculentum
tomato
950 Mb
Zea mays maize
2365 Mb
Homo sapiens man
3038 Mb
Triticum aestivum wheat
17000 Mb
(Genome sizes were taken from the NCBI's Genomic Biology web pages.)
The availability of sequence data for the Arabidopsis genome and EST projects has ensured that Arabidopsis is now more widely used than ever as a model for the study of flowering plants, and new or improved enabling technologies are being applied to it as they become available for research. For example, with the availability of genomic sequence and EST data, gene silencing or RNAi technology is being used as a research method to determine the function of Arabidopsis genes. The Arabidopsis Genomic RNAi Knock-out Line Analysis (AGRICOLA) project was started in 2002 with the goal of providing the scientific community with research tools (specific vectors and cloned Arabidopsis DNA) necessary to study gene function via gene silencing.
Understanding the patent landscape surrounding Arabidopsis is important in planning research that aims for application in products, because of the continued importance of Arabidopsis as a research tool to many thousands of scientists. Furthermore, use of the genomic information of Arabidopsis in patents and patent applications has resulted in broad claims for patent coverage of genes in other flowering plants.
Khush GS (1997) Origin, dispersal, cultivation and variation of rice. Plant molecular biology 35 (1-2), 25-34 (Sep 1997)
See background information about the structure of chromosomes. The genome of Arabidopsis thaliana consists of 5 chromosomes, one circular mitochondrial DNA, and a circular chloroplast DNA. Chromosomes are often drawn as in Figure 1 below: with a short and long arm and an intervening centromeric region. Figure 2 depicts the shapes and relative sizes of the A. thaliana chromosomes. Table 1 shows the size (in Mbp) and predicted number of genes for each chromosome.
|
|
![]() |
|
Figure 1 |
Figure 2 |
Table 1
|
Chromosome |
Size |
Predicted Genes |
|---|---|---|
|
Chr1 |
30.4 |
7863 |
|
Chr2 |
19.7 |
5115 |
|
Chr3 |
23.5 |
6255 |
|
Chr4 |
18.6 |
4811 |
|
Chr5 |
27.0 |
7084 |
|
Mitochondria |
0.366 |
135 |
|
Chloroplast |
0.154 |
122 |
|
Total |
~120Mb |
31,385 |
The Arabidopsis Genome Initiative (AGI) was set up in 1996. Consisting of 6 groups, it's mission was to sequence the genome of Arabidopsis thaliana by 2004 at high accuracy. Various aspects of the sequencing mission were divided between the 6 groups:
|
|
Links:
Stanford
University
|
The sequencing project was completed in 2000, four years ahead of schedule.
The sequence derived from the AGI's efforts were the subject of a number of publications in nature, including:
Before, during, and subsequent to the AGI's efforts, groups, including some of those involved in the genomic sequencing, were involved in sequencing Expressed Sequence Tags (ESTs) from Arabidopsis. The following companies and institutes were involved in these efforts:
Today there are more than 622,000 Arabidopsis ESTs in the dbEST database at the National Center for Biotechnology Information (NCBI).
Analysis of the recent history of patent documents involving Arabidopsis provides some interesting information regarding the evolution of present day patenting strategies for organisms whose genomes are available. It also offers the opportunity to investigate Arabidopsis as a "model" for future patenting of other plant genomes.
The following simple search was performed using PatentLens to search the term "Arabidopsis" in full text patent documents (detailed search parameters here). Although it must be emphasized that this search is likely to overestimate the number of patent documents making claims to Arabidopsis "inventions" (it will also uncover documents making reference to Arabidopsis), its main purpose should be seen as a rough measure of the impact of Arabidopsis on patent publications.
|
|
|
Figure 1: Patent documents containing the term "Arabidopsis" in the full text
|
|
|
|
Figure 2: Interest in Arabidopsis as gauged from comparisons of patent applications, and journal and sequence publications Although the number of patent filings of documents containing the term "Arabidopsis" fell in 2004 and 2005, the number of deposited Arabidopsis sequences in GenBank remains high (although there is a large variation in the number of sequences deposited from year to year). Additionally, "Arabidopsis" appears as a search term in an ever increasing number of PubMed entries at NCBI. One interpretation of the sequence and publication trends is that interest in Arabidopsis as a model organism remains high, and possibly increases with time. However, this later interest (after 2003) has not translated into an increase in patent applications. Such observations are consistent with what may be expected for a mature technology. (Again, please note that PubMed entries contain the word "Arabidopsis" in the text, they need not necessarily be publications directed towards Arabidopsis. This is similar to the case of full text searches using PatentLens. Hence we argue that the trends in this data are comparable, providing that the user understands the limitations of the datasets and searches.) |
One of the caveats with the analysis above is that it does not focus on patents making claim to sequences from Arabidopsis. Such an analysis is difficult due to the fact that more than one sequence can be claimed in a single patent or patent application. Hence a plot of the number of Arabidopsis sequences in claims against the year of application may look very different from a plot of patent documents containing claimed Arabidopsis sequences versus year of application.
There are many ways to define the limits of a claim towards related genes, nucleotides, or protein sequences. One way is through a direct comparison of two aligned sequences in order to determine if they are "similar" enough to each other so that the claim language is satisfied. Typically this requires a quantifiable comparison, often in the form of "% identity":
US 2005/50583 (assigned to Genesis/Agrigenesis, according to press releases now subject to the rights of ArborGen) Claim 3
An isolated polynucleotide comprising a sequence selected from the group consisting of:
(a) sequences having at least 75% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
In this case if a compared sequence (even a sequence not in the possession of the patentee, such as one discovered in another species) falls within the 75% sequence identity range, it could potentially be subject to the patent applicant's claims. Alternatively, subject to a number of detailed caveats (for the US see 35 USC 102), if a compared sequence more than 75% identical was made public or submitted for patent by someone else prior to the priority date of the patent application ("prior art"), the patent claim may be invalid.
Such a comparison is technically straightforward, although for a long sequence listing with more than a handful of sequences (almost 500 are claimed here) it would require a large amount of time for the examiner to do a thorough analysis to determine whether or not the claim is valid, and for a researcher or a member of the public to determine whether any particular sequence might be infringing.
Even more difficult for the patent examiner and the public to compare, to clearly define the bounds of what is subject to patent claims, is the use of such claim language as:
US 2005/91708 (assigned to Dow Chemical) Claim 1
An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-7554 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic.
Hybridisation experiments would seem to be a definitive way to define similarity in patent documents. Unfortunately, the tools to perform fast and accurate comparisons of sequences based on such language, without the use of a research laboratory, are lacking in the patent office, and what is more, a skilled practitioner will realise that many (possibly only distantly related!) nucleic acid sequences will hybridise together under low stringency conditions. At some frequency, any DNA at all will hybridise to the claimed sequence under any stringency conditions.
Hence hybridisation language, when used in patent documents, has the potential to create uncertainty in understanding the scope of the claim. The probability that an applicant will achieve such claims to all 7554 sequences is small. However, it is possible not only that some of these sequences will be successfully claimed in issued patents, but that most remain pending in future applications. The uncertainty with this claim is two-fold:
- Which sequences are going to be successfully claimed in future?
- Will the applicant also acheive claims to related sequences? On the face of it, this claim would apply to sequences from any other species, not necessarily a plant species, and not necessarily sequences in the possession of the patentee, that hybridise to any of the 7554 claimed sequences and alter anything about a plant when expressed in the plant.
Below we explain, for those unfamiliar with the concept, how hybridisation works and how it is often presented in a patent application or patent.
DNA is a biological polymer consisting of "simple" repeated subunits, or bases. It is within the sequence of these bases that genetic information is stored. Although DNA can exist as a single strand in solution, it is generally found within living systems together with its complementary strand (see the figure below). If we consider a simple representational diagram of a DNA molecule, consisting of a sequence of bases: A, G, C, and T. We can see that some of the bases interact with one another in the figure below.
The bases A,T,G, and C form hydrogen bonds with their respective bases T, A, C and G, in the complementary strand. These bonds provide a stability to the double strand DNA (dsDNA), under normal circumstances, preventing it from dissociating into single strand DNA (ssDNA). Significantly, dsDNA can be "melted" to produce ssDNA under certain physical circumstances and can be influenced by many factors such as temperature, salt concentration, inorganic solvents, sequence composition, and (importantly) the degree of sequence similarity or missmatch between ssDNA molecules.
The Concept of Melting Temperature (Tm)
Mathematical relationships between the various physical parameters listed above and the melting point of DNA have been established. Historically, the point at which 50% of the DNA molecule is ssDNA is referred to as the melting temperature or Tm.
The Tm for a DNA sequence can be estimated via the commonly-used calculation:
Where Tm = melting temperature in oC
[Na+] = Molar concentration of sodium ions in
%[G+C] = percent of G+C bases in DNA sequence
n = length of DNA sequence in bases
P = temperature correction for % mismatched base pairs (~1oC per 1% mismatch)
F = correction for formamide concentration (= 0.63oC per 1% [formamide])Note: A particular DNA sequence will have different Tm values under different conditions. This calculation is sometimes referred to as the effective Tm.
The fact that ssDNA can re-anneal to form dsDNA is used in the process called hybridisation.
NOTE that the Tm refers to 50% hybridisation, not complete hybridisation!
A typical hybridisation experiment consists of a number of steps:
- Binding of target ssDNA to a solid support (often a thin nitrocellulose, nylon, or polymer membrane), referred to here as a "blot".
- Blocking any remaining free binding sites for DNA on the blot.
- Labeling of the "query" ssDNA or probe.
- Creation of a hybridisation solution containing the probe at a known concentration of salt, formamide,...
- Equilibrating the "blot" with the hybridisation solution at a defined temperature (for a specified length of time).
- Washing the unhybridised probe off the blot using a specific wash protocol; the wash conditions are actually more important than the hybridisation conditions for determining what will remain hybridised, but are less often specified in patent applications!
- Detecting the bound probe by some means.
If the probe (or query DNA) is similar enough to the target ssDNA (on the blot) then the probe will remain bound to it after the wash step.
"Stringency" refers to the concentrations and temperatures in steps 4-6 above, and it is most often "strigency" that is referred to in order to determine the scope of claims. Strictly speaking, this should refer to the "stringency" of the wash step required to leave the probe bound to target sequences covered by the claims: the "query" is often a sequence mentioned in the claims section (as in the example given above).
Although the table below is not legally binding and indeed patent applications may make their own definitions of stringency levels, the typical usage is roughly:
Stringency
[NaCl] (Molar)
Range from Tm (oC)
Description
High
0.0165 - 0.0330
5 to 10oC below Tm
Only highly similar sequences will bind under these conditions (typically >95%)
Medium/Moderate
0.165 - 0.330
20 to 29oC below Tm
Those sequences above and less homologous ones (typically >80%)
Low
0.330 - 0.825
40 to 48oC below Tm
Sequences with lower homology will bind under these conditions (>50%?)
1. Note that the values here are for the purpose of discussion. Actual values and conditions must be acquired from the patent document specifications.
2. The "Range from Tm" is the temperature of the wash solution required to achieve the level of stringency and is dependent on the factors described above.
3. Values in Column 2 are typical only!
4. Values in column 3 were obtained from US 2006/0057724 A1
5. Values in Column 4 are typical only and sourced from the web. Refer to patent specifications for your details!
6. Definition of stringency levels may vary between patents (BEWARE). Stringency conditions are usually defined in the patent document!Sometimes the patent examiner will require the applicant to stipulate the stringency conditions: high/medium/or low stringency, or specifying the [NaCl] and temperature of the wash conditions. Few bother to give a reference with respect to Tm (a value that can be calculated). It is possible that the conditions defined as stringent in one patent document may not be as stringent as those defined in another!
Arabidopsis is an extremely important model system used to study many aspects of plant biology. From the graphs and figures presented in this chapter, it is obvious that there are a great many references to Arabidopsis in the scientific literature and in patent documents. These observations are consistent with the importance of Arabidopsis as a model organism for the study of plant biology. The interesting aspect of the appearance of "Arabidopsis" in the patent literature is the fact that Arabidopsis is NOT a commercial crop. Compared to important crops such as canola, soybean, and potato there would appear to be little value in patents directed towards Arabidopsis.
While the term "Arabidopsis" appears frequently in patent documents, it is the contents of the "claims" section of these documents that is most important. Since the focus of this landscape is towards the Arabidopsis genome, it is of interest to understand to what extent these claims are directed towards sequences and genes from Arabidopsis. Also of interest is an applicant's reasons for claiming sequences from an organism with little commercial value. In the following chapters we will analyse example patent applications and granted patents more closely to answer some of these questions.
Since patents containing claims to Arabidopsis sequences are the focus of this landscape, much of the analysis in the following chapters will involve patent documents making claims to large groups of genes and sequences from Arabidopsis. These patent documents will be described here as "bulk sequence" applications. For the sake of convenience and clarity, we will define "Bulk Sequence" applications as those that claim protection for 100-or-more Arabidopsis sequences. It is these patent documents that form the core of the following patent analysis.
Within the last decade there has been a trend towards attempts at patenting large groups of nucleic acid sequences from sequencing of EST libraries and genomes. There are “bulk sequence” claims over large portions of many genomes, including that of Arabidopsis. Since Arabidopsis was the first plant to have its genome sequenced, is a popular model organism, and there are extensive expressed sequence data (ESTs and cDNA sequences) available, examination of the patent landscape of Arabidopsis offers the opportunity to examine what may happen to other plant genomes.
Bulk sequence applications are important because they potentially cover genes involved in a very wide variety of processes in development, metabolism, and disease resistance. Hence it is a concern to examine the success of such patent applications in granting exclusionary rights to early applicants who can control what can be done with later research in these far-reaching areas.
The effects are not limited to those working on Arabidopsis. Arabidopsis is not a crop plant. If the economic gain to be had from patents on Arabidopsis genes was limited to royalties from Arabidopsis growers, there would be few patent applications pending on the genome of this plant. Most who work on it are using it as a model for important related species such as soybean and cotton. The claims in many of the bulk sequence applications described below would allow the rights-holders of patents on them to exclude all others from the use of genes not only in and from soybean and cotton, but many other leguminous food crops, fiber crops including trees such as Populus, fruit crops and ornamentals, and subsistence crops such as cassava and cowpea that are staples in the diets of millions of people.
Thus Arabidopsis offers a further dimension as a plant model: as an example of plant genome patenting.
Historically there have been a number of companies interested in the genomic sequences and expressed sequence tags (ESTs) from Arabidopsis. Many companies, including Paradigm Genetics Inc. (now Icoria Inc.), Ceres Inc., Mendel Biotechnology Inc., Syngenta, Dow Chemical, and BASF have applied for patents over Arabidopsis sequences.
The ownership of rights to Arabidopsis sequences is complicated by the pattern of mergers, acquisitions, and exclusive research licenses that give control of the intellectual property to entities different from those named as the patent applicants. In some cases when the ownership of a patent or patent applications changes hands, an assignment of rights is filed with the patent office, but in the U.S. there is no requirement that rights controlled through exclusive licenses be publicly recorded.
Monsanto has equity stakes in or controls the rights over the intellectual property resulting from exclusive research contracts with Mendel, Ceres and Paradigm/Icoria, as noted in press releases. Monsanto has actually acquired a portion of Paradigm/Icoria (Agrinomics) but in this acquisition and in the exclusive research agreements, information about the extent of the control over the intellectual property has not been made public. Monsanto has also had extensive historical ties to the Arabidopsis genome sequencing project, and has extensive bulk sequence claims over genes from other species such as cotton and soybean. Monsanto's efforts to patent ESTs from maize were rebuffed in a court case suggesting a higher standard for utility should be applied for sequence patents (re Dane Fisher case; See also this commentary link), but hundreds of thousands of sequences remain in patent applications over Arabidopsis as well as soybean, maize, rice, and other crops.
Syngenta also controls intellectual property claims through granted and pending patents nominally assigned to a number of companies that have been partially or completely acquired by Syngenta or its parent companies, and through research alliances. Nominal assignees of Arabidopsis intellectual property probably controlled by Syngenta include Torrey Mesa, Zeneca, Ciba Geigy, Novartis, Advanta, Garst, Agripro and Danisco.
See bar graphs showing the portions of the Arabidopsis genome that are covered by patent applications and granted patents that are assigned to the following companies and others with whom they have equity stakes and exclusive research alliances: DuPont, Monsanto, Paradigm Genetics, and Syngenta.
Much of the focus of patenting attempts and granted patents for Arabidopsis genes (and those of many other organisms) have been towards patent protection in the US. Other countries, such as Australia, often use the US as a model for guiding their own patent office procedures. Hence, when considering the legal aspects of bulk sequence claims, we will be referring to the current state of affairs in the US and in particular the actions of the United States Patent and Trademark Office (USPTO) towards bulk sequence applications. To understand the basis for patent protection in the US, one must understand that:
|
A patent applicant is entitled to a patent on an invention if it is novel, inventive, has utility, and if the patent disclosure enables its use. |
The default, with respect to applications, is that the applicant is entitled to the patent, and the applicant is given the opportunity throughout the examination process to edit the claims of the application to maximise the chance that a patent is issued. Hence the onus is on the patent office to prove that a patent application fails to meet the criteria for patentability. Patent examiners have a limited amount of time to spend examining an application.
It can be argued that such bulk sequence applications represent attempts to patent "discoveries" (which are not patentable) rather than "inventions". Specifically, the applicant has added to the amount of human knowledge by disclosing something that occurs in nature, rather than adding to human knowledge through the creation of something new (a useful invention). Therefore, it could be argued that bulk sequence applications lack an inventive step.
For an application that claims many sequences, applicants have historically argued that novelty, inventiveness, and utility are met by such large sequence applications. They have done so by pointing out that such sequences can be used (inventiveness and utility) for particular purposes, including:
These proposed uses bring to light another problem with such groups of sequences: Enablement.
The disclosure of information made in "payment" for the grant of a patent must enable a person "skilled in the art" to use the invention. A bulk sequence application may fail to adequately address this issue, since it would be extremely difficult for any applicant to give specific instructions on how to use all the sequences for the purposes listed above.
The USPTO has responded to attempts to patent large groups of sequences by strictly adhering to the requirement for enablement and substantial utility for such applications.
Recently, the USPTO's commitment to these standards were challenged by Monsanto, who had previously applied for patent protection for 32,236 maize ESTs (US patent application 09/619,643), and were refused a patent. Monsanto appealed the USPTO's decision in the Court of Appeals for the Federal Circuit. The appeal was denied by a majority of 2 out of 3 judges, based on a lack of utility and enablement (re Dane Fisher case; See also this commentary link).
Since the re Fisher case was not a unanimous decision, a question still remains as to whether-or-not future large sequence claims will be successful.
Another important point to note about granted patents:
|
Under US law a granted patent is assumed to be valid. |
This fact, in addition to the high cost of contesting a granted patent (generally around one million US dollars), provides an effective barrier to those interested in challenging a patent's validity.
Bulk sequence applications have (and are still) used to make claims to large numbers of sequences from an organism (including many animals, bacteria, and plants). There is no upper limit to the number of sequences that can be claimed within a single patent application within the US system. Some plant applications may contain more than 100,000 sequences. Hence the success of even a single bulk sequence application would mean that a large portion of a plant genome would be protected by patent claims. Another interesting point is that it is common for companies to license technologies that have a "patent-pending" status (i.e. that have not yet received a granted patent). Hence the protection that might be derived from a granted patent is incentive enough to provide protection in itself. Therefore patent applications (including bulk sequence applications) are also important when considering issues of deliverability, innovation and freedom-to-operate.
Whilst Monsanto's efforts to patent large numbers of maize ESTs were recently unsuccessful, examples of granted bulk sequence patents exist, and applicants continue to pursue bulk sequence applications at the USPTO.
|
Are there bulk sequence applications for Arabidopsis ? |
|---|
In the following chapters we will investigate bulk sequence applications for Arabidopsis genes.
Mendel is a US company located in Hayward, California. Deriving its name from the "father of genetics", Gregor Mendel, the company was founded in early 1997 with the idea that controlling gene expression would lead to new ways of manipulating plant growth and development.
Mendel's main interest in Arabidopsis is the 1,800-or-so transcription factors that control the expression of ~27,000 Arabidopsis genes. Accordingly, Mendel has applied for patents over a large number of Arabidopsis transcription factors. Presumably this strategy relies on the fact that control of expression of genes via transcription factors is one way in which useful new phenotypes may be engineered into flowering plants (including many agronomically important crops). Since transcription factors are expected to play important roles in plant development, it is expected that Arabidopsis transcription factors may be functionally and structurally similar to, and highly homologous with transcription factors of other flowering plants. Thus a claim on Arabidopsis transcription factors may also effectively be a claim over the same gene in many other flowering plants.
RNA polymerases (RNAPs) are enzymes involved in the process of transcription (transcribing an RNA molecule using DNA as a template). While prokaryotic RNAP can recognise gene promoters and initiate transcription from the proper start point, the efficiency of the process is further controlled by trans-acting proteins called Transcription Factors (TFs). In both eukaryotes and prokaryotes, the process of transcription is tightly regulated for many genes and one way in which the cell is able to control transcription is through the use of TFs. In eukaryotes RNAP II requires the presence of a group of auxiliary proteins (general Transcription factors) to be able to recognise gene promoters. Hence production of mRNA in eukaryotes requires the presence of general TFs, and may also be further controlled by the presence of gene-specific TFs.
TFs are proteins that bind to specific DNA sequences within the promoter or enhancer regions of a gene and in so doing are able to control the transcription of the gene. They may either decrease (repressors) or increase (activators) the level of transcription. They are often involved in the final step in a signal transduction pathway and can be activated or deactivated by other proteins higher in the pathway. TFs can be recognised from EST and genomic DNA sequencing projects based on the fact that they often contain sequence motifs such as: zinc fingers, helix-turn-helixs, and basic-helix-loop-helix (for DNA binding) and leucine zippers (for interaction between TFs).
We have chosen one patent application by Mendel Biotechnology as an example of its patenting strategy towards transcription factors. The patent application US 2004/0019927 A1 attempts to claim a large group of Arabidopsis transcription factors and related sequences in plants. This is the largest of the Mendel patent applications for bulk sequences that we were able to find (although there may be more), others also exist that claim either individual or small groups of TFs. Analysis of granted TF patents are discussed in Chapter 7.
HAYWARD, Calif., July 10 /PRNewswire/ -- Mendel Biotechnology, Inc.
("Mendel") today announced a new research and commercialization partnership with
Monsanto Company ("Monsanto"). The new program represents a significant
expansion of the ongoing collaboration between the companies and extends
the agreement through 2011. Mendel and Monsanto have agreed not to disclose
financial details of the agreement.
Monsanto and Mendel will continue to collaborate on the commercialization by
Monsanto of technologies developed by Mendel. In addition, the
companies will initiate a systems biology program to develop an integrated
framework for predictive control of plant gene expression that is anticipated
to streamline discovery and product development activities...
This new partnership extends the relationship initiated in 1997, under which
Mendel executed a functional genomics program in the model plant
Arabidopsis focused on understanding the factors that control plant gene
expression. That program identified a number of genes that have been shown to
enhance the yield of major crops in extensive field trials and that are in early
commercial development by Monsanto.
Press Release Source: Mendel Biotechnology, Inc. July 10, 2006
"Four years ago, Chris Somerville, head of plant biology at the Carnegie Institution of Washington, slipped a gene for making plastic into Arabidopsis, a type of mustard plant. The gene turned the plant into a biological plastics factory. Now, Monsanto Co. scientists are turning the concept into commercial reality....Just as exciting is a recent discovery by Calgene [now owned by Monsanto] scientists of the gene for the enzyme controlling the formation of cellulose in plants. After 30 years of fruitless biochemical search for the enzyme, ''this is our first break in understanding how to control biomass,'' Somerville explains. Genetically boosting the enzyme could make it possible to create trees with much higher proportions of cellulose--the plant kingdom's structural fiber--and less than the normal amounts of other cell wall components. Because these secondary components are what make the pulp- and papermaking process polluting and inefficient, scientists say, the engineered trees could help clean up a major industry. Beyond that, ''there is a mad scramble inplant biology to find the most useful genetic sequences,'' says Somerville. ''The world hasn't even seen the tip of the iceberg.'’ ‘“THE BIOTECH CENTURY”, Business Week, 1997 Chris Somerville is still at the Carnegie Institution, but is also the CEO of Mendel.

This diagram identifies a locus in the Arabidopsis genome with many patent applications covering genes in a pathway important for biofuels. Mendel's patenting (and that of other large companies such as DuPont) is not only over crops critical to worldwide food security, but also industrial uses that could add to the prosperity of developing countries if they were available to exploit. Who gets to exploit them?
www.mendelbio.com/news/
Moderna, Monsanto link up in new venture
FOCUS - Moderna, Monsanto link up in new venture Reuters, Monday, November 24, 1997 at 20:09 By David Luhnow
MEXICO CITY, Nov 24 (Reuters) - Mexican conglomerate Empresas La Moderna (MEX:MDA.A) and U.S. agricultural giant Monsanto Co (NYSE:MTC) on Monday said they had bought a stake in newly formed agricultural genetic research company Mendel Biotechnology Inc. Each company paid $15 million to fund a five-year research and development project at Mendel, a start-up firm based in San Francisco and headed by some of the world's leading plant genome experts, according to La Moderna officials in Mexico. In return, Moderna and Monsanto each get a 20 percent equity stake in Mendel and rights to develop and commercialize Mendel's technical capabilities in the next generation of value-added agricultural products.
Both companies also have an option to buy an additional 10 percent stake in Mendel at an undisclosed future date, La Moderna company spokesman Dieter Holtz told Reuters. Each firm will have rights to Mendel's research in the areas they are interested in and dominate, officials said. Monsanto will focus on Mendel's advances in agronomics, or the capabilities in plant genetics and genomics for many crops, including corn, soybeans, fruits and vegetables. "This collaboration gives us the abilty to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, Monsanto's assistant chief scientist and chief biotechnologist. Kishore said the venture will greatly reduce research and development time for products that increase yield or otherwise enhance the growing, processing or nutritional characteristics of food.
La Moderna, the world's largest vegetable seed maker, will have access to Mendel's research in vegetable seeds as well as fresh and processed fruits and vegetables. "We will each focus on our core businesses: Monsanto in agronomics and (its herbicide) Roundup and Moderna in vegetable seeds and fruits and vegetables," Holtz said in a telephone interview.
Earlier this year, Moderna sold its Asgrow agronomics division to Monsanto for about $240 million in order to focus more on the Mexican company's strength in vegetable seeds. Mendel's president and CEO is Michael Fromm, a former research director at Monsanto, and its chairman of the board is Christopher Sommerville, director of the Department of Plant Biology at the Carnegie Institution of Washington. "These are the best of the best in the field," Holtz said. The top researchers at Mendel were responsible advances in areas such as disease resistance, the molecular genetics of nitrogen fixation in plants and the genetics of photosynthesis, he added.
"We didn't want to have majority control of the company because
we didn't want to cramp their entrepreneurial spirit. We thought it was better
to see what they came up with on their own," Holtz said.
Copyright 1997, Reuters News Service
Monsanto, Empresas in Joint Venture with Mendel
MONSANTO, EMPRESAS IN JOINT VENTURE WITH MENDEL Reuters, Monday, November 24,
1997 at 16:22 ST. LOUIS, Mo., Nov 24 (Reuters) - Monsanto Co (NYSE:MTC) and
Empresas La Moderna S.A. (MEX:MDA.A) said Monday they signed a broad
technology collaboration agreement with Mendel Biotechnology Inc in the field of
agricultural functional genomics.
The agreement gives Monsanto and Empresas exclusive access to Mendel's technical capabilities in plant genetics and genomics for many crops, including corn, soybeans, fruits and vegetables.
As part of the agreement, Monsanto and Empresas will each acquire a substantial minority equity interest in Mendel and fund a research and development program over five years. Financial terms of the agreement were not disclosed. "This collaboration gives us the abilty to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, Monsanto's assistant chief scientist and chief biotechnologist.
Kishore said the venture will greatly reduce research and development time for products that increase yield, or otherwise enhance the growing, processing or nutritional characteristics of food.
Copyright 1997
Monsanto and ELM Enter Into Joint Technology Venture With Mendel Biotechnology
PR Newswire, Monday, November 24, 1997 at 19:46 ST. LOUIS, Nov. 24 /PRNewswire/ -- Monsanto Company and Empresas La Moderna, S.A., (ELM) jointly have signed a broad technology collaboration agreement with Mendel Biotechnology Inc. in the field of agricultural functional genomics. The agreement gives Monsanto and ELM exclusive access to Mendel's technical capabilities in plant genetics and genomics for many crops, including corn and soybeans, and fruits and vegetables. Leaders in the plant genetics and genomics fields to identify the function of genes founded Mendel Biotechnology and patent the corresponding DNA sequences that will produce the intellectual property basis for the next generation of agricultural products created through biotechnology.
As part of the agreement, Monsanto and ELM will each acquire a substantial minority equity interest in Mendel and fund a research and development program over a five-year period. Financial terms of the agreement were not disclosed.
"This collaboration gives us the ability to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, assistant chief scientist and chief biotechnologist of Monsanto. Kishore said the venture will greatly reduce research and development time for products that increase yield, or otherwise enhance the growing, processing or nutritional characteristics of food.
"This is our first venture in genomics, and it builds nicely on our original technology agreement with Monsanto as a preferred provider of agronomic and quality traits we're already using in our fruit and vegetable seed and produce businesses," said Alfonso Romo Garza, chairman and chief executive officer of ELM. Through this partnership, ELM will have exclusive rights to Mendel's technology for the development of proprietary, transgenic fruits and vegetable products that create value for growers, processors and consumers.
Mendel's founding partners include several of the world's most renowned plant geneticists. Their scientific contributions include the isolation of genes responsible for disease resistance, the molecular genetics of nitrogen fixation in plants, the genetics of photosynthesis and plant oil biosynthesis, the development of synthetic plastic in plants, and the gene transfer technology for corn.
"Applying cutting-edge genomics capabilities to the agricultural biotechnology strengths these two leaders have developed is a natural next step in discovering and commercializing improved-trait crops," said Christopher R. Somerville, chairman of Mendel. Somerville also is the director of the Department of Plant Biology at the Carnegie Institution of Washington, and professor of the Department of Biological Sciences at Stanford University. The president and chief executive officer of Mendel Biotechnology is Michael Fromm, a former research director of enabling technologies and plant genomics for Monsanto.
As a life sciences company, Monsanto is committed to finding solutions to the growing global needs for food and health by sharing common forms of science and technology among agriculture, nutrition and health. ELM is a leading agribusiness biotechnology company focused on developing and marketing premium branded vegetable seeds, as well as fresh and processed fruits and vegetables. Mendel, a newly formed biotechnology company, specializes in applying functional genomics techniques to create valuable new traits in plants.
SOURCE: Monsanto Company 11/24/97
CONTACT: Lori J. Fisher, 314-694-8535
Lori.J.Fisher@monsanto.com
Since Mendel Biotechnology Inc. is known to have interests in Arabidopsis, the company name was used as a search term to uncover new patent documents.
The Mendel Biotechnology Inc. patents discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
18 05 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(arabidopsis AND thaliana) AND (Mendel AND Biotechnology) |
|
Results |
161 |
|
Comments |
Of the 161 results identified using these search terms, 11 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims. The correspondent or applicant named on all these applications is Mendel Biotechnology Inc., 21375 Cabot Blvd., Hayward, CA, 94545. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
As a precaution, a further search was performed: this time no assignee/applicant/correspondent names were used. Instead the strategy conducted was:
|
Search details |
|
|---|---|
|
Date of search |
14 06 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(mendel AND biotechnology) AND ((SEQ AND ID) in claims) |
|
Results |
186 |
|
Comments |
Of the 186 results identified using these search terms, more than 10 were found to claim large numbers of Arabidopsis sequences (mostly transcription factors). The remainder were also of interest since they appeared to also claim smaller groups of transcription factors. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following is an example of one of the 11 patent documents identified as belonging to Mendel Biotechnology Inc. The '927 application is discussed here for the following reasons:
(Note that this patent application contains by far the most sequence claims compared to other applications. Note also that there are ~1800 TFs in the Arabidopsis genome and there are >1900 SEQ IDs listed in this application alone! Hence this application is likely to be a significant patent application with respect to TFs for many plant species.)
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
THIS IS NOT A GRANTED |
Title - Polynucleotides and polypeptides in plants
The claims are generally drawn to:
A transgenic plant comprising the SEQ IDs listed above, including polynucleotides as in (a), transcription factors with conservative substitutions from (a), transcription factors at least 80% identical to (a) or (b), orthologous or paralogs at least 80% identical to (a) or (b), polynucleotides containing conserved domains at least 80% identical to conserved domains in (a) or (b), and polynucleotides that hybridise to those in (a) or (b) under stringent conditions.
An expression cassette comprising, a promoter of (1) above, and containing a polynucleotide [as described above for claim 1]
This time the claim is towards a method for producing a modified plant comprising a polynucleotide [as described above for claim 1] The claims above are defined with reference to the sequences defined by SEQ ID numbers. One interesting question here is the degree to which an examiner is able to effectively search for prior art. It is not unreasonable to expect that the details of one or more transcription factors having at least 80% identical sequence to one or more of the SEQ IDs above may already be published in the literature or in another patent document. |
Original Applicant: Mendel Biotechnology, Inc. , 21375 Cabot Blvd., Hayward, CA, 94545, For information concerning this application, we would suggest contacting either: MATTHEW KASER Mendel Biotechnology, Inc. , or: Dr. William Goure, |
|||
| Remarks |
Again, as with the Paradigm patent applications mentioned above, it is unlikely that this application would result in a single granted patent for all of these sequence claims. However, through numerous additional applications (continuations, CIPs and divisionals) it is entirely possible that any of the sequences listed may evolve into separate applications and may eventually be granted as patents. Such a process may take many years. |
||||
|
Other interesting Mendel Biotechnology Inc. patent documents include |
|||||
|
Document |
Title |
DNA Sequences |
|||
|
Polynucleotides for seed trait alteration |
74 |
||||
|
Plant gene sequences I |
85 |
||||
|
Stress-induced polynucleotides |
111 |
||||
|
Plant gene sequences II |
104 |
||||
|
Gene sequences and uses thereof in plants |
368 |
||||
| Yield-related genes |
35 |
||||
| Genes for modifying plant traits IV |
232 |
||||

US 2007/22495
This Mendel patent application, one of a family of, to date, over 180 patent applications filed in Australia, Brazil, Canada, Japan, Mexico, the PCT, and possibly other jurisdictions that do not report to INPADOC, relates to polynucleotides and polypeptides associated with transcription factors from A. thaliana. Note how non-specific the utility is: particularly, the transcription factors are those responsible for modifying phenotypes of plants. Very broad language is used for the phenotypes in claim 1b below; a patent examiner could rightly question whether applicant has described how any one of the sequences of part (a) of the claim relate to any one of the phenotypes in part (b) of the claim, because on its face the claim is to any plant having any characteristic listed in claim 1b and containing any sequence, regardless of source, that is somewhat like any of the sequences in claim 1a.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
|
Title - Transcription factors for increasing yield
|
Mendel Biotechnology, Inc. |
|||
|
Remarks |
This application is a Continuation-in-part of application No. 09/713994, filed on 16 Nov 2000, now abandoned. |
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assignee |
|||
|---|---|---|---|---|---|
|
Title - Biosynthetic pathway transcription factors
|
Mendel Biotechnology Inc. |
US 6946586
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assignee |
|||
|---|---|---|---|---|---|
US 6946586 B1
|
Title - Genetic trait breeding method
|
Mendel Biotechnology Inc. |
US 7193129
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assigne |
||
|---|---|---|---|---|
US 7193129 B2
|
Title - Stress-related polynucleotides and polypeptides in plants
|
Mendel Biotechnology Inc. |
US 7196245
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assigne |
|||
|---|---|---|---|---|---|
US 7196245 B2
|
Title - Polynucleotides and polypeptides that confer increased biomass and tolerance to cold, water deprivation and low nitrogen to plants
|
Mendel Biotechnology Inc. |
US 7223904
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assigne |
|
|---|---|---|---|
US 7223904 B2
|
Title - Plant gene sequences II
|
Mendel Biotechnology Inc. |
The following is an example of one of the granted patents for transcription factors from Mendel Biotechnology Inc. As previously noted, the patent is part of a much larger family of documents containing applications claiming many Arabidopsis sequences, many of which are still pending in many countries:
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
|
Title - Transgenic plants comprising polynucleotides encoding transcription factors that confer disease tolerance
|
Original Applicant: Mendel Biotechnology, Inc., Hayward, CA., USA |
|||
|
Remarks |
SEQ ID NO 18 is a transcriptional activator (AtERF-1; ethylene responsive element binding factor 1). Note that the claims issued in this patent are far narrower than those of the patent application originally filed, which was discussed above. The narrowing of claims probably reflects the existence of sequences that had already been disclosed which would have been infringing the overly broad claims of the patent application. Note, however, that claims to all the other sequences originally claimed could still be pending. The scope of this claim is limited to comprising language around the peptide sequence disclosed in SEQ ID NO 18. It may be possible to achieve the same or similar phenotye modifications that expression of AtERF-1 would cause in plants by the use of a homologous sequence from elsewhere in the same genome or from another species. However, it is not truly clear how much there is freedom to operate. How far does the doctrine of equivalents provide protection in this case? Would an AtERF-1 homologue less than 90% identical at the amino acid level be a work around in this case? Also, is "tolerance" the same as disease resistance? Note also that fungal disease is specifically mentioned in the claims of the issued patent. However, this does NOT imply that the applicants could not obtain claims over bacterial or viral diseases or any of the other phenotypes mentioned in the applications still pending. |
This patent application is related to nucleotide sequences encoding plant transcription factors from soybean, rice, maize, barley, wheat and other plants as well as Arabidopsis thaliana. Like the transcription factor patent application shown before, it is part of a family of over 180 patent applications filed in Australia, Brazil, Canada, Europe, Japan, the USA and the PCT as well as possibly in other jurisdictions that do not report to INPADOC.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|
|---|---|---|---|
|
Title - Polynucleotides and polypeptides in plants
|
Mendel Biotechnology, Inc. |
|
|
Remarks |
A corresponding patent (US 7196245) has been issued, in which claims on only two sequences were granted for a WRKY DNA-binding domain. This was also filed in Australia, Brazil, Canada, Japan, Mexico and the PCT. |
The following are US patents that have issued from the above-described Mendel patent applications on "Polynucleotides and Polypeptides from Plants". Mendel, and probably its research ally Monsanto, has the right to exclude anyone from using technology covered by the claims below in the US or in products to be imported into the US. Note that while the claims are narrower than those in the corresponding patent application, this is no cause for feeling relieved about how many research avenues they leave open for development into deliverable products.
For example, in the patent 6,717,034 shown below, the scope of the claims is towards transgenic plants having increased biomass (compared to a control), and the methods involved in producing them. The scope is further limited with respect to the sequence used to obtain this phenotype. Specifically the DNA sequence in SEQ ID 1 is used, along with the peptide sequence in SEQ ID 2, to define the scope in plants. Claims include DNA sequences (and complementary sequences) that code for SEQ ID NO: 2 (including SEQ ID NO 1), and sequences that "bind under stringent conditions" to SEQ ID 1 (and complementary sequences). Such claim language opens the possibility of using other transcription factors that might result in similar useful phenotypes (e.g. would the equivalent transcription factor from another dicot, that codes for a peptide sequence other than SEQ ID NO 2, and is sufficiently different to SEQ ID NO 1, be a possible work-around for this patent?) However, before making such a conclusion it would be important to examine the prosecution history, i.e. the interactions between the patent office and the applicant, about how the doctrine of equivalents would need to be taken into account).
Also, this patent is one of a large INPADOC family, and broader claims to the same sequences, as well as to all the other sequences in the application, could still be pending. This means that at any time patents with broader claims could still issue from any of the over 100 patent applications that are still pending.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||||
|---|---|---|---|---|---|---|---|
|
Title - Method for modifying plant biomass
|
Original Applicant: Mendel Biotechnology, Inc., Hayward, CA., USA |
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Assignee |
|||
|---|---|---|---|---|---|
|
Title - Biochemistry-related polynucleotides and polypeptides in plants
|
Mendel Biotechnology Inc. |
Mendel's interest in transcription factors of Arabidopsis are clearly demonstrated in their patenting strategy. Like Paradigm Genetics, Mendel's early patent applications contain claims to large groups of sequences. However, unlike Paradigm, Mendel's patenting activities have extended beyond the initial bulk sequence claims. A large and complex patent family has arrisen from a small number of initial patent applications. Unlike Paradigm's applications, the '927 application discussed above is the result of a series of continuations of previous applications. The '927 application has itself led to still further continuation applications. Many of the Mendel applications are still active, and even those that have abandoned or expired have led to further applications. Unlike Paradigm, Mendel has granted patents for some transcription factors and methods relating to transcription factors (e.g.US 6664446 , US 6835540, and US 6717034).
Mendel has invested significant resources in developing this patent document family. It is likely that Mendel will continue this strategy to eventually obtain patents on more transcription factors from Arabidopsis. The language of the granted claims is crucial in determining whether claims over other dicot species will be possible. The recent granted patent, US 6717034 (towards a transcription factor involved in modifying plant biomass), has the following claim language:
|
Claim #3: A transgenic plant comprising a recombinant polynucleotide comprising a polynucleotide sequence that hybridizes over its full length under stringent conditions to: (a) a nucleotide sequence comprising SEQ ID NO: 1, or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; or (b) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2, or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2; wherein the stringent conditions comprise wash conditions of 0.2×SSC to 2.0×SSC, 0.1% SDS at 60-65° C. and wherein expression of the polynucleotide sequence that hybridizes to either (a) or (b) increases the plant's biomass as compared to a control plant not transformed with said recombinant polynucleotide. |
|---|
Such language is more limiting in scope than that of the '927 patent. However, it is not unreasonable to expect that such hybridization conditions may be met by at least homologs from other species within the mustard family. Whether-or-not homologs from other important crop species meet the same criteria cannot easily be deterimined, and would require experimental analysis. In part, this uncertainty evolves from the use of hybridization language in the claims. Such claims, based on hybridization conditions, are difficult to quantify and the huge cost of possible litigation may act as a barrier to those interested in developing IP around transcription factors in other dicots.
To demonstrate the complexity of the patent document family that Mendel has created, consider the following diagram (the '927 application family):
![]() |
|
(Hexagons: represent pending applications; Diamonds: represent adandoned or expired applications; F = filing date; CIP = continuation-in-part application) |
Broad claims in Applications:
Arguebly, it is the role of a good patent attorney to provide the best
possible outcome for a client. It is probably for this reason that the scope of
initial claims in a patent application is often as wide as possible. Such broad
intent is demonstrated in the text of the '927 application, where the following
is stated:
| Sequences homologous, i.e., that share significant sequence identity or similarity, to those provided in the Sequence Listing (except CBF sequences SEQ ID NOs: 1955-1960), derived from Arabidopsis thaliana or from other plants of choice, are also an aspect of the invention. Homologous sequences can be derived from any plant including monocots and dicots and in particular agriculturally important plant species, including but not limited to, crops such as soybean, wheat, corn (maize), potato, cotton, rice, rape, oilseed rape (including canola), sunflower, alfalfa, clover, sugarcane, and turf; or fruits and vegetables, such as banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, pumpkin, spinach, squash, sweet corn, tobacco, tomato, tomatillo, watermelon, rosaceous fruits (such as apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, Brussels sprouts, and kohlrabi). Other crops, including fruits and vegetables, whose phenotype can be changed and which comprise homologous sequences include barley; rye; millet; sorghum; currant; avocado; citrus fruits such as oranges, lemons, grapefruit and tangerines, artichoke, cherries; nuts such as the walnut and peanut; endive; leek; roots such as arrowroot, beet, cassaya, turnip, radish, yam, and sweet potato; and beans. The homologous sequences may also be derived from woody species, such pine, poplar and eucalyptus, or mint or other labiates. In addition, homologous sequences may be derived from plants that are evolutionarily-related to crop plants, but which may not have yet been used as crop plants. Examples include deadly nightshade (Atropa belladona), related to tomato; jimson weed (Datura strommium), related to peyote; and teosinte (Zea species), related to corn (maize). |
The importance of using hybridisation and sequence similarity language to broaden the scope of claims can be inferred from the following two figures from the '927 application. These figures relate the evolutionary position of Arabidopsis to other important crops. One may suspect that those crops most closely related to Arabidopsis would be most likely to have genes with highly homologous sequences (and thus satisfy the hybridisation and/or similarity ranges stated in the claims):
|
![]() |
Paradigm Genetics Inc. (Research Triangle Park, N.C., US) was founded in 1997 and went public in 2000. In its early years Paradigm's business statements emphasised making use of genomics information obtained from Arabidopsis, for extensions into crop plants via agricultural biotechnology. During this time it sequenced a large number of Arabidopsis ESTs.
In 2004 Paradigm changed its name to Icoria Inc. Icoria moved into medical biotechnology and sold assets relating to its agricultural genomics business to Monsanto Co. In the period 1997-2004 Monsanto had extensive ties to the Arabidopsis genome sequencing project, and extensive bulk sequence patent filings in crop plants (e.g. rice, maize and cotton), although because of non-publication requests by Monsanto and other procedural matters, many of these were not known to the public until 2004 or later (see for example US 2004/31072, which claims 285,684 sequences mostly from soybean, with applications in many other related crop plants).
We have been able to find 9 patent applications with Paradigm Genetics Inc. as the applicant or correspondent. Since the 2004 acquisition these may now be assigned to Monsanto, although USPTO assignment information provided by the applicants is not always kept up to date.
Since all 9 patents are very similar, and each claim ~1000 ESTs from Arabidopsis, we have chosen one, US2001/44940 A1 (the '940 application) to discuss below.
See bar graphs showing the portions of the Arabidopsis genome that are covered by patent applications and granted patents that are assigned to Paradigm Genetics Inc.
Since the major players in patenting of Arabidopsis sequences are known (from press releases and biotechnology publications), company names can be used in searches. In this case Paradigm Genetics was used as part of a search strategy.
The Paradigm Genetics Inc. patents discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
18 05 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(arabidopsis AND thaliana) AND (paradigm AND genetics) |
|
Results |
361 |
|
Comments |
Of the 361 results identified using these search terms, 9 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims. The correspondent named on these applications is PARADIGM GENETICS, INC., 104 ALEXANDER DRIVE, BUILDING 2, P O BOX 14528, RTP, NC, 277094528. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
As a precaution, a further search was performed: this time no assignee/applicant/correspondent names were used. Instead the strategy conducted was:
|
Search details |
|
|---|---|
|
Date of search |
14 06 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) |
|
Results |
3,824 |
|
Comments |
The results were sorted by relevance score, and the top 300 patent documents were investigated. Whilst new and interesting Arabidopsis patents appeared, no-other Paradigm applications or patents covering bulk sequence applications could be identified (although some may have been missed by this search). You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following is an example of one of the 9 patent applications by Paradigm Genetics Inc (now Icoria Inc.). The '940 A1 application has the earliest filing date of the 9 applications. All 9 applications are listed as "abandoned" on the USPTO website database. However, one must remember that "abandoned" does not necessarily mean that Icoria or now Monsanto Co. have actually given up their attempts at patenting all or part of the claims. It may still be possible that future continuations, continuations-in-part, or divisional applications might appear as patent document publications.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
||||||
|---|---|---|---|---|---|---|---|---|
|
Title - Expressed sequences of Arabidopsis thaliana
The claims are generally drawn to:
Essentially, this claims the ESTs themselves, any homologues, allellic variants,..etc. and may cover the equivalent gene in other organisms. The "comprising" language in this claim makes the claim's scope so broad that there almost certainly will be existing prior art. Stringent hybridisation conditions are defined as "50°C or higher and 0.1×SSC". "A fragment thereof" is not clear in itself and could imply a 2bp sequence! The application arguably provides the following guidance "Preferably, hybridization is performed using at least 15 contiguous nucleotides...".
This essentially claims DNA constructs or vectors containing the ESTs, or part of the ESTs, or their homologues, and any sequences from other organisms that satisfies the stringency conditions above (since no organism limits are made here!). It is hard to imagine that none of these sequences would hybridize to DNA from other organisms (even human sequences!). How can an examiner be certain that any of these sequence claims are not invalidated by prior art (particularly other bulk sequence claims for rice and maize ESTs??!)
This time the claim is towards a transgenic plant transformed with the claimed DNA. Arguably, the "operably linked" phrase may allow a work around of at least part of this claim: by the use of transactivation technology (see PCT publication WO 01/21781).
Similar to the previous claim, for cells transformed with the claimed DNA. Again, consider transactivation technology and PCT publication WO 01/21781. Interestingly, the qualifier "plant" is not used to describe the cell. Does this imply that this claim covers animal, fungal, and protist cells, whereas the previous did not cover transgenic animals?
Attempts to cover use in microarray technologies. What does stably bound mean? What if the bond is reversible?
As above for DNA arrays, in this case for peptide arrays. However, no mention is made of peptide length - this could mean anything from full length peptides to 2-amino acids! (There is no specification for this in the application) |
Original Applicant: PARADIGM GENETICS, INC., Which recently became: Icoria Monsanto has since acquired selected agricultural genomics assets from Icoria: Monsanto Company We would suggest that interested parties should try to contact: Charles W Burson Executive Vice President, General Counsel, and Secretary
Telephone: 314-694-1000 |
||||||
| Remarks |
Given the broad scope of the claims above, it is not expected that Paradigm's 9 bulk sequence applications would produce a granted patent over all the sequences listed. Instead Paradigm's goal is probably to establish a priority date for all the sequences, and the opportunity to select from this group sequences that may become valuable and can be patented in divisional, CIPs, and continuation applications in the future. This unfortunately introduces uncertainty for those working on these or related genes. Since it is virtually impossible to tell which genes will be the focus of future active applications. It is also possible that future efforts by others to study the function of these genes may produce results that influence's Paradigm's/Icoria's/Monsanto's decision on which gene(s) to continue patenting efforts. Future research that shows gene 'X' to be useful (valuable) might be sufficient to initiate further applications for that gene in divisional applications. |
|||||||
|
The remaining 8 applications are: |
||||||||
|
Application No. |
Document Title |
Sequences Claimed |
||||||
| US 2002/23280 |
Expressed sequences of arabidopsis thaliana |
999 |
||||||
| US 2002/23281 |
Expressed sequences of arabidopsis thaliana |
999 |
||||||
| US 2002/40489 |
Expressed sequences of arabidopsis thaliana |
999 |
||||||
|
Expressed sequences of arabidopsis thaliana |
999 |
|||||||
|
Expressed sequences of arabidopsis thaliana |
999 |
|||||||
|
Expressed sequences of arabidopsis thaliana |
999 |
|||||||
|
Expressed sequences of arabidopsis thaliana |
900 |
|||||||
|
Expressed sequences of arabidopsis thaliana |
999 |
|||||||
The 8 remaining applications are:
|
Publication No. |
Application No. |
Filing Date |
Publication Date |
Document Title |
Sequences Claimed |
|---|---|---|---|---|---|
| US 2002/23280 |
60/178,502 |
26 Jan 2001 |
21 Feb 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
| US 2002/23281 |
60/178,472 |
26 Jan 2001 |
21 Feb 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
| US 2002/40489 |
60/178,503 |
26 Jan 2001 |
04 Apr 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
|
60/178,512 |
26 Jan 2001 |
04 Apr 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
|
|
60/178,506 |
26 Jan 2001 |
16 May 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
|
|
60/178,480 |
26 Jan 2001 |
23 May 2002 |
Expressed sequences of arabidopsis thaliana |
999 |
|
|
60/148,784 |
07 Aug 2001 |
03 Oct 2002 |
Expressed sequences of arabidopsis thaliana |
900 |
|
|
60/178,466 |
26 Jan 2001 |
19 Jun 2003 |
Expressed sequences of arabidopsis thaliana |
999 |
Paradigm Genetics Inc. was essentially a start-up company, created to make use of (and patent) DNA and EST sequences generated from Arabidopsis. By making use of the new technology of EST sequencing it could potentially gain patents over a large number of Arabidopsis genes. Due to the language of the patent claims:
"...hybridizing under stringent conditions to a sequence set forth in..."
it is entirely likely that successful claims would "read on" large numbers of related genes from other dicot species. In particular those that have agricultural importance: potato, soya, canola,...
All 9 patent applications of Paradigm are classified as "abandoned" in the USPTO's database PAIR. The assignee of these applications at the USPTO is still listed as Paradigm Genetics. Note also the fact that these are US applications, and as far as we are able to tell, were not the focus of applications elsewhere in the world.
Unfortunately, although classified as "abandoned" it is still too early to say for certain that these applications are no longer "active" for the following reasons:
It is unlikely that, in their present state, patents would be granted on these applications, since the claims appear to be very broad. However, Paradigm's applications may demonstrate one early step in a bulk sequence patenting strategy:
|
Apply for a patent over as many sequences as possible and establish the earliest possible priority date. |
|---|
This strategy may have the following benefits:
By creating such important potential IP, a small company such as Paradigm became potentially valuable to larger companies such as Monsanto, who have large agricultural biotechnology interests. The decision by Paradigm to move into human health biotechnology, was almost certainly facilitated by funds generated in the sale of agricultural assets to Monsanto. In turn, Monsanto acquired potentially useful IP and the loss of a competitor in Paradigm/Icoria.
Also, Monsanto is probably in a far better position to make use of such patented technologies. In fact Monsanto has its own bulk sequence claims for maize and cotton.
As we will see in the following chapters, others have refined this patenting strategy in Arabidopsis.
Ceres is a privately-owned, US-based, plant biotechnology company (www.ceres.net) with headquaters in Thousand Oaks, California. It makes use of genomic technologies towards developments in agriculture and human health. Ceres specialises in high throughput screening, full length cDNA sequencing, plant breeding, and targeted gene activation. Interestingly, as with Paradigm and Mendel, Ceres has ties to Monsanto, with whom it has extensive license-based agreements.
The company states clearly on its website that the development and protection of IP is an important goal. On the same site it claims that Ceres has filed applications on over 50,000 full length genes and 10,000 promoter sequences. Ceres' goal of sequencing full-length cDNA sequences compared to ESTs or genomic sequences appears to arise from the need to gain composition of matter claims in US patents (and it seeks at least one substantial utility per gene sequence for patenting purposes). Ceres' genomics facilities apparently allow it to be able to investigate gene function such that descriptions of multiple utilities for each gene of interest can be obtained.
We chose to analyse a some recent applications by Ceres covering both large numbers of functional Arabidopsis sequences and also promoter sequences. Ceres' patent applications provide a more recent example of a bulk sequence application for analysis, and for comparison to the Paradigm and Mendel applications. Analysis of one example promoter application is also useful since Ceres has interests in this technology.
Ceres' IP interests in Arabidopsis are focused towards claims on promoter sequences and full length genes with agricultural utility.
To date, Ceres has filed patent applications on over 50,000 full-length functionally annotated genes and over 10,000 promoter sequences
Intellectual Property
Ceres places a large emphasis on the generation and protection of intellectual property. Ceres' intellectual property strategy leverages three major competitive advantages: (1) the ability to discover and sequence full-length genes in a high-throughput manner, (2) an integrated suite of genomic technologies which enables Ceres to ascribe specific utilities to a very high proportion of the genes that it discovers, and (3) information management systems which enable the rapid and accurate filing and follow-up of thousands of patent applications.
Ceres practices a proprietary technology for capturing and sequencing full-length cDNAs. Unlike ESTs, these sequences cover the complete and accurate open reading frame of a gene giving the information necessary for a cell to manufacture the encoded protein. Genomic sequence can sometimes give equivalent information, but accurate predictions of gene structure from genomic sequence are only accurate in approximately 50% of the cases tested. The US Patent and Trademark Office has indicated that accurate full-length genes are necessary for the issuance of composition of matter claims that cover the production of the encoded protein. In addition, at least one substantial and practical utility must be described for a particular gene sequence. Ceres' integrated genomics engine enables the description of multiple utilities for any given gene. The combination of this utility information and the full-length gene sequences should enable Ceres to gain intellectual property protection for a large portfolio of important plant genes.
The generation and integration of such a large amount of data into accurate patent application submissions requires a sophisticated information management effort. Ceres has designed and incorporated information management technologies into its technology platforms to enable the timely and accurate generation of thousands of patent applications. To date, Ceres has filed patent applications on over 50,000 full-length functionally annotated genes and over 10,000 promoter sequences.
http://www.ceres-inc.com/about/intell.html
April 3, 2002
Monsanto and Ceres Announce Important Product Discovery and Development Collaboration
St. Louis, Missouri and Los Angeles, California -- Monsanto Company and Ceres, Inc. today announced an important product discovery and development collaboration focused on applying genomics technologies to provide improvements in certain agricultural crops.
"Our new alliance with Ceres provides us access to a proprietary knowledge base that will accelerate Monsanto's product pipeline. By marrying the complementary genomics capabilities of Ceres with the product development capabilities of Monsanto, we will be in a position to deliver additional value to farmers worldwide," said Hendrik Verfaillie, chief executive officer of Monsanto.
Walter De Logi, chief executive officer of Ceres, said, "Monsanto is an ideal partner for Ceres because it is a leader in providing biotechnology-based agricultural products to the world's farmers. Monsanto has the resources to give Ceres' technology the widest possible distribution. Furthermore, we share a vision of providing abundant food and a healthy environment around the world."
Under the collaboration, Monsanto will acquire rights to Ceres technologies in certain crops and applications in exchange for payments over several years. Ceres will receive additional payments subject to meeting specified objectives for developing additional related technology, as part of its continuing commitment to genomics-based product discovery. Monsanto will also fund a jointly implemented research program and has made a minority equity investment in Ceres. Expected payments to Ceres under the collaboration are $137 million over several years plus potential royalties.
As part of the collaboration, Monsanto and Ceres have agreed to make technologies accessible to farmers in developing countries, including non-profit humanitarian applications not served by commercial markets or research and development investments. This innovative feature of the collaboration reflects the commitment of both companies to bring the benefits of science and technology to society through public-private partnerships to improve subsistence crops, in addition to the commercial development of products around the world.
Ceres, Inc. is a privately held biotechnology company utilizing multiple integrated plant genomics technologies to develop innovative products. In addition to pursuing opportunities in the food, feed and fiber industries, Ceres' business strategy consists of leveraging its technology platform in the agrochemical, chemical and pharmaceutical industries.
Monsanto Company (NYSE: MON) is a leading provider of agricultural solutions to growers worldwide. Monsanto's employees provide top-quality, cost-effective and integrated approaches to help farmers improve their productivity and produce better quality foods. For more information on Monsanto, see: www.monsanto.com.
http://www.ceres-inc.com/PressRelease_News/04_03_02.html
Ceres Announces Genomics Milestone in Energy
Crop Enhancement Program (info)
12,000 full-length switchgrass genes sequenced and genetic variation
characterized
THOUSAND OAKS, CA – July 10, 2006
Contact: Ceres, Inc. Shirley Bell sbell@ceres-inc.com
www.oxbio.com/pressreleases/071006_Ceres.pdf
MONSANTO COLLABORATES WITH CERES
Monsanto Company and the privately owned biotechnology company, Ceres Inc, Los Angeles, have announced collaboration to apply genomics technologies to provide improvements in agricultural crops. "Our new alliance with Ceres provides us access to a proprietary knowledge base that will accelerate Monsanto's product pipeline. By marrying the complementary genomics capabilities of Ceres with the product development capabilities of Monsanto, we will be in a position to deliver additional value to farmers worldwide," said Hendrik Verfaillie, CEO of Monsanto.
Under the collaboration, Monsanto will acquire rights to technologies in certain crops and applications in exchange. Expected payments to Ceres through the deal will be $137 million over several years as well as possible royalties. Monsanto and Ceres have also agreed to make some technologies accessible to farmers in developing countries, including non-profit humanitarian applications.
Crop Protection Monthly - 30 April 2002, Issue No. 149
Published by: Market Scope Europe Ltd
Website: http://www.crop-protection-monthly.co.uk
Editor: Brian R. Hicks
E-mail: brianralphhicks@aol.com
Since the major players in patenting of Arabidopsis sequences are known, the company names could be used in searches. In this case "Ceres" was used as part of a search strategy.
The Ceres Inc. patents discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
18 05 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
Arabidopsis AND (Ceres in applicant) |
|
Results |
36 |
|
Comments |
Of the 36 results identified using these search terms, 2 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims. The applicant named on these applications is CERES, INC., Thousand Oaks, CA, US. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following is an example of a recent patent application by Ceres towards a large group of Arabidopsis sequences. Whilst there are many-more-than 100 sequences in this application (2523 sequence IDs listed), approximately half of these are for the corresponding protein sequence, and some sequences are from corn, soya bean, wheat, brassica, and others. This patent document also provides an example of a more recent bulk sequence application, than those of Paradigm or Mendel. The focus of this application is towards Arabidopsis polypeptides and the sequences encoding them,...
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
||
|---|---|---|---|---|
|
Title - Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics and phenotypes
The claims are generally drawn to:
The 85% sequence identity at the peptide level is broad, opening the possibility of finding prior art (although this might require analysis tools that may not be available to examiners). The hybridization language used here also allows the possibility of very broad claim scope. Again opening up the possibility of finding prior ar. The language used, and the 40° C to 48° C below Tm stipulation make it extremely difficult to do a thorough analysis of prior art. However, tools that allow such analysis might very-well lead to prior art discovery. The patent text defines the hybridization temperatures (-40 to -48° C from Tm) of this claim to be equivalent to "low stringency" conditions!
One interpretation of this is that claims over the orthologs listed in FIG 1 are also being made, and sequences 85% identical to them. This again broadens the scope of the claim - and may be enough to read on prior art. Discovery of such prior art would require a thorough analysis of not only sequences that are 85% identical to the SEQ ID No., but also of the orthologs in the alignment in Fig1. Are examiners able to do this effectively, given the time and resources available to them? |
Original Applicant: CERES, INC., Thousand Oaks, CA (US) For information regarding this application and possible licensing, we would suggest contacting: Dr Peter Mascia |
||
|
Remarks |
The scope of claimed sequences for this Ceres application relies on "hybridization language", as does the Paradigm Genetics applications discussed previously. Such claims scope are difficult to quantify. In this case Ceres has provided a temperature range (compared to Tm) for their hybridisation conditions. Unfortunately, by comparing this range to the literature (and their own) definition of stringency, their definition constitutes "low stringency" conditions. Thus their claims to Arabidopsis sequences almost certainly will cover the same or similar sequences from many dicots. |
|||
|
Other patent documents of interest |
||||
|
Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics |
~65 Arabidopsis sequences |
The following is an example of granted patents for promoter sequences from Ceres Inc. Although the application shown here contains less than 100 sequences, they are included to demonstrate Ceres' interest in Arabidopsis promoters and the relationship between the claims in granted patents and patent applications. Additionally, Ceres has many applications and patents for promoters, which together might constitute claims equivalent to a bulk sequence application.
The following 816' application does not list Ceres as the applicant or correspondent! By performing google searches using the inventors names (shing kwok yu-ping) it was possible to infer that these inventors were employed by Ceres around the same tme that the patent application was made (the publication number 20060008816 is not present in the USPTO assignee database at the time of this analysis). Hence the inclusion of the application below:
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
|
Title - Promoter, Promoter control elements, and combinations, and uses thereof
The claims are generally drawn to:
|
Original Applicant:
Ceres Inc. |
|||
|
Remarks |
It is interesting to note that using the USPTO assignee database to search for "CERES" as ASSIGNEE returns patent documents that have the University of California named as applicant and that were later assigned to Ceres. Hence using CERES in the search below is likely to miss granted patents now assigned to Ceres. Additionally there is no guarantee that patents now assigned to Ceres will have their assignments updated in the USPTOs databses! |
The patents for promoter sequences discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
08 09 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(Arabidopsis AND thaliana AND Ceres) AND (promoter in claims) AND (SEQ AND ID) |
|
Results |
Of the 34 results obtained from this search, many had no obvious links to Ceres. Confirming the identity of Ceres as the assignee was difficult since "Ceres" as a search term uncovered documents containing "Ceres" linked to materials used in experimentation (e.g. Ceres microarrays...). For this reason we chose to discuss only the single document above to demonstrate Ceres' interests in Arabidopsis promoters. Please take note that this document was chosen because it is part of a larger document family with 5 US and one PCT applications. It is likely that the above represents only a small part of the promoter sequences included in patent documents. |
|
Comments |
You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
US 2006/150283
|
Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
US 2006/150283 A1
|
Title - Sequence-determined DNA fragments and corresponding polypeptides encoded thereby
|
Ceres, Inc. |
|||
|
Remarks |
We have no information that a corresponding patent application has been filed in any other country besides the USA. |
This patent application is directed to nucleotide sequences and their encoded polypeptides from soybean, wheat , maize and canola, as well as Arabidopsis thaliana. According to sequence information published in the USPTO's PSIPS database, 1032 sequences are included in this application.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|
|---|---|---|---|
|
Title - Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics
|
Ceres, Inc. |
|
|
Remarks |
US Application with missing sequence identifiers
|
Publication No. |
Title, Independent Claims and Summary |
Assignee and licensing information |
|||||
|---|---|---|---|---|---|---|---|
|
Title - Nucleotide sequences and polypeptides encoded thereby for enhancing plant drought tolerance
|
Ceres, Inc. |
|||||
|
Title
- Nucleotide sequences and corresponding polypeptides conferring modulated plant size and biomass in plants
|
||||||
|
Remarks |
The related PCT publication is WO 2006/76099 A2. We have no information that a corresponding patent application has been filed in any other country besides the USA. |
Cere's stated interests in patenting of whole cDNA sequences and promoters is supported by the patent documents we found for Arabidopsis.
Our analysis of bulk sequence applications focused on their 724' A1 application. This document was chosen becuase it represents a more recent example of a bulk sequence application than the Paradigm or Mendel applications. Interestingly, there are a few differences between such applications in the past 6-or-so years. Differences between the Cere's and Paradigm applications include the change from EST sequence claims (whch are effectively partial coding sequences only), towards complete cDNA sequences. One significant difference between Ceres and Paradigm's applications is the fact that Ceres has analysed missexpression of the claimed peptides in plants and made claims based on sequences that are more likely to have utility.
Mendel, in the patent application describes previously, also investigated the function of claimed transcription factors by over expression or knock-down experiments. However, unlike Mendel, the Ceres' bulk sequence application appears to be at a much earlier stage of the patenting process. The patent family surrounding the Mendel application is far more extensive and complex than that of the present Ceres application. It is not unreasonable to suspect that Ceres patenting strategy for Arabidopsis has been guided by knowledge and experience of previous bulk-sequence claims, and perhaps by the USPTO's requirement for significant utility in sequence applications.
Some interesting points, underlined by the analysis of the promoter patent applications, were:
Considering the points above, it is a difficult and time-consuming task to actually identify the present assignee of a particular patent. This is particularly true if there is little benefit for the assignee in disclosing this information in a clear and public fashion. It is entirely possible that many Ceres patent documents may not be identified from the analysis conducted here, which used a combination of both a standard patent document search (using "Ceres" in FULLTEXT) and a USPTO assignments database search (using "CERES" in ASSIGNEE).
Dow Chemical Company is a multinational, US-based, corporation with headquarters in Michigan. Followed by DuPont, it is the largest chemical company in the world (based on market capitalisation). It has interests not only in chemicals, but also in agricultural products, plastics and specialised products and services.
Its wholly-owned subsidiary, Dow AgroSciences LLC, is based in Indianapolis, Indiana, USA. Dow AgroSciences has interests in crop protection and crop biotechnology. Both Dow and Dow AgroSciences are applicants on a number of applications involving Arabidopsis. Two of these applications that appear to deal with bulk sequence claims are discussed below.
Since the major players in patenting of Arabidopsis sequences are known, the company names could be used in searches. In this case "Dow" was used as part of a search strategy. Unfortunately using arabidopsis AND (Dow in applicant) AND ((SEQ AND ID) in claims) did not uncover any patent documents of interest for bulk sequnence claims. Hence the general search strategy was broadened to the more general search below:
The Dow Chemicals Co. patents discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
19 07 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
arabidopsis AND dow AND (seq AND ID) |
|
Results |
372 |
|
Comments |
These results, when sorted according to revelvance score, uncovered some interesting applications in the top 100 documents. In the top 100 documents we discovered 8 of interest. These included US and PCT applications, with titles beginning " NUCLEIC ACID COMPOSITIONS CONFERRING..." By further researching these applications the documents dicussed below were uncovered. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following are patent applications by Dow Chemical Co. These applications cover a large number of claimed sequences (7,554 SEQ ID NOs in the first below). However, these applications contain sequences from a number of different organisms. Thus, not all the sequences are specifically from Arabidopsis. Claims are targeted towards sequences that result in altered metabolic, visual, and disease tolerance characteristics (i.e. those that have possible utility). There are 177 Arabidopsis sequences claimed in the first application below, and the other plant sequences claimed (e.g. from Nicotiana benthamiana) may overlap with other dicot species (including some from Arabidopsis) - depending on the scope of claim language used.
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||
|---|---|---|---|---|---|
|
Title - Nucleic acid compositions conferring altered metabolic characteristics
The claims are generally drawn to:
Conditions of low stringency may allow many of these sequences to hybridise to a range of sequences from dicots, monocots, and fungi. Thus these are very broad claims, and unlikely to be granted in their present state. The properties and binding characteristics of the "Artificial" sequences would require further analysis to determine whether they overlap with Arabidopsis sequences...
As above...
This method claim is not well defined, and is very broad in scope. It is unlikely that the specifications and descriptions in the patent itself are enough to enable all aspects of these claims. Which metabolites? What isolation method? What metabolic characteristics? What bioinformatics system? What is a "suitable analytic methodology"? |
Original Applicant: |
|||
| Remarks |
The following organism sequences are present in the application: |
||||
|
There is a corresponding PCT application that deals with the same sequences. However, no other applications could be found at this time in other jurisdictions. |
|||||
Dow Agrosciences has a similar application for a group of 2,000 sequences:
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
||
|---|---|---|---|---|
|
Title - Nucleic acid compositions conferring altered visual phenotypes
The claims are generally drawn to: An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered visual phenotype. An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency for use in producing a plant with an altered visual phenotype. |
Original Applicant: |
||
| Remarks |
Many of the sequences in this application have no clear indication of which organism they are from. Many only have numerical identifiers, and it is difficult to tell which are from Arabidopsis. Additionally, we could not find a sequence file deposited with the USPTO (one must exist however), and so we could not easily identify Arabidopsis sequences using this method. We chose several OCRed sequences from the sequence list and discovered that they were not present in GenBank at NCBI. |
|||
|
There is a corresponding PCT application that deals with the same sequences. However, no other applications could be found at this time in other jurisdictions. |
||||
Dow and its subsidiary DowAgroscience have interests in biotechnology R&D focused towards:
The bulk sequence documents analysed above claim a mixture of sequences from a number of different plant species, both monocots and dicots. For Dow, sequences from Arabidopsis appear to be less important than claims on sequences from organisms that have more direct economic value such as maize, cotton, sorghum, alfalfa and soybeans, and some species that are both model systems and crops, such as tobacco or rice. Additionally the Dow applications studied also contain a large number of claims to fungal sequences. For example, Trichoderma harzianum (sequences claimed in the patent document discussed above) is a fungus that is used as a biocontrol measure to protect crop plants against fungal pathogens such as Fusarium sp. (a fungal pathogen of cotton, amongst others).
Dow and DowAgroscience's diverse interests are reflected in their bulk sequence applications. So although these applications contain some Arabidopsis sequences, Arabidopsis is less a focus of patenting activity for Dow than Nicotiana benthamiana, another popular model plant system. The interest in N. benthamiana may be due to a number of reasons, including:
Applications claiming sequences from dicot crops
Monsanto Company has patent applications claiming bulk sequences from dicot crops such as soybean and cotton. We include these patent applications here because many of the claimed sequences from these crops are orthologous to sequences from A. thaliana.
US 2007/83945
This patent application covers soybean sequences . It remained unpublished for 83 months instead of the normal 18 months.
|
Title, Independent Claims and Summary |
Assignee and licensing information |
|||||
|---|---|---|---|---|---|---|
|
Title - Nucleic acid molecules and other molecules associated with plants
|
Monsanto Co. |
||||
|
Remarks |
We have no information that a corresponding patent application has been filed in any other country besides the USA. |
US 2007/61916
This patent application covers cotton (Gossypium hirsutum) sequences.
|
Publication No. |
Title, Independent Claims and Summary |
Assignee and licensing information |
|||
|---|---|---|---|---|---|
|
Title -Nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement
|
Monsanto Co. |
|||
|
Remarks |
We have no information that a corresponding patent application has been filed in any other country besides the USA. |
US 2007/236419
This Monsanto patent application claims sequences from soybean, maize and rice, and it is related to a family of patent applications US 2004/31072, US 2004/34888, US 2004/214272, US 2004/216190 and US 2007/11783.
.
|
Publication No. |
Title, Independent Claims and Summary |
Assignee and licensing information |
|||
|---|---|---|---|---|---|
|
Title - Nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement
|
Monsanto Co. |
|||
|
Remarks |
We have no information that a corresponding patent application has been filed in any other country besides the USA. |
Patent application by CropDesign
|
Patent/ Publication No. |
Title, Independent Claims and Summary |
Assignee and licensing information |
|||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Title - Identification of novel e2f target genes and use thereof
|
CropDesign N.V. |
|||||||||||
|
Remarks |
The related applications were also filed in Europe (EP 1551983 A2) and Australia (AU 2003298095). |
One of the characteristics of bulk sequence claims in plants is that they are almost always in the form of patent applications. The applicants strategy in claiming bulk sequences is often to establish an early priority date and to later obtain granted patents on individual genes or small groups of genes derived from the initial applications. Hence , to this point, we have considered only patent applicatons. Whilst these applications have the potential to influence those working in dicot biotechnology, they are not granted patents.
In this section we will look more closely at examples of patents granted on cliams to Arabidopsis Transcription Factors (TFs).
This group was chosen because it includes patents granted to Mendel Biotechnology Inc., who have made application claims for bulk sequences from Arabidopsis. Since the applications are to transcription factors from Arabidopsis, they may also cover homologous sequences from other dicots. Thus analysis of granted patents for TFs offers the opportunity to analyse an important group of genes and also the ultimate patent claims that may evolve from bulk sequence claims.
The patents for Arabidopsis Transcription Factors discussed in this section were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
30 06 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND (transcription AND factor) |
|
Results |
2,509 |
|
Comments |
Of the 2,509 results identified using these search terms, only the top 100 results (based on relevance score) were considered. The content of their titles, abstracts, and claims were used to identify granted patents on Arabidopsis Transcription Factors. This procedure was in no way exhaustive, and because of the large number of transcription factors in Arabidopsis and their possible value in modifying plant phenotypes it is very likely that many TF patents were not considered in this analysis. The TF patents discussed below are representative of this group of genes from Arabidopsis. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
A new search was performed to attempt to limit the number of useful identified patent documents:
|
Search details |
|
|---|---|
|
Date of search |
30 06 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND ((transcription AND factor) in claims) |
|
Results |
92 |
|
Comments |
Of the 92 results identified using these search terms, only 7 were granted patents of interest. The remainder were applications and/or were not for Arabidopsis genes. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
A new search was performed designed to identify granted patents from Mendel only:
|
Search details |
|
|---|---|
|
Date of search |
30 06 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND (transcription AND factor) AND (mendel AND biotechnology) |
|
Results |
40 |
|
Comments |
Of the 40 results identified using these search terms, only 4 were granted patents of interest. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following are some other examples of granted patents for Arabidopsis sequences. Note that there are many more granted patents for Arabidopsis, far more than we can analyse at present. We invite interested readers to suggest patented genes and sequences (or a particular area of interest) for further analysis.
The following is an example of a granted patent for an Arabidopsis transcription factor from Korea Kumho Petrochemical Co. This was chosen as a comparison to those granted transcription factor patents from Mendel Biotechnology:
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|
|---|---|---|---|
|
Title - Transcription factor gene induced by water deficit and abscisic acid isolated from Arabidopsis thaliana
The claims are generally drawn to:
The scope of this claim is limited to the nucleic acid for Athb-12. The
scope of claims are not defined according to similarity or hybridisation
language. No claims are specifically made towards organisms transformed with
this sequence or to methods to produce them. It might be argued that the the
doctrine of equivalents allows a wider scope to similar sequences. |
Original Applicant:
|
|
|
Remarks |
Note that this granted patent is an earlier patent than those of Mendel's.
It is uncertain whether the limited scope of the claims reflects prior existing
art (unlikely for such an early application?) or merely reflects early attempts
to patent gene sequences. |
A comparison of the granted claims from the 466' patent and the
claims of the parent application demonstrates one important difference between
applications and granted claims
The following is claim #1 from the provisional patent application US 60/125,814:
Note that the term "...substantially the same DNA binding
specificity,..." is difficult to quantify and that the claims are
towards sequences from conserved DNA binding domains of transcription factors.
Thus these claims are potentially very broad.
Compare the text above with the equivalent claim from the granted US patent
US6664446
that evolved from the 814' application:

Whereas, the application claims refers to a broad group of conserved DNA-binding domains from transcription factors, the granted patent refers specifically to polynucleotides that encode a single protein - SEQ ID NO.18. This protein is a full-length 268-amino-acid transcription factor from Arabidopsis. Hence this claim is very-much limited in scope compared to the application.
Even-so, it seems likely that this granted claim covers more than a single polnucleotide sequence.
This is due to the following:
Notice also that the 446' patent was filed in 2000, and granted in 2003 and claims benefit of the U.S. Provisional Application Serial No. 60/125,814 filed Mar. 23, 1999. There are at least 17-other patent applications that claim priority from the same provisional application. It is possible that other granted patents will arise from this family in the near future.
An important point demonstrated in this chapter, by comparison with previous chapters, is the difference between bulk sequence applications and the patents that are likely to evolve from them. Granted patents that provide exclusionary rights based on sequences from Arabidopsis are often very-much restricted in claims scope than their parent applications. So, while the claims made in bulk sequence applications by companies such as Paradigm and Mendel may be broad, descendent patents appear to be reasonably limited in scope.
In one sense this is an entirely obvious, and not unreasonable observation. Patent applicants often attempt in the first instance to claim the maximum range possible, and these claims end up being moderated and limited by rounds of revisions required by patent examiners. This is a common feature of patent applications and is not limited to applications containing sequences. In fact such a strategy is almost enshrined, if not promoted, by many patenting systems.
As stated in a previous chapter, granted patents are not necessarily the problem (although some may argue with this!). Instead, it is the uncertainty generated by the ability of large companies to make such bulk sequence applications that may cause problems for those working with Arabidopsis (and flowering plants). The process of setting early priority dates with bulk applications and then keeping these applications pending through chains of continuations, continuations-in-part, and divisional applications, results in the following:
Such uncertainty may not be at the forefront of much not-for-profit research, it will however be pivotal in decisions made by private R&D companies, where FTO is critical. With the growing need for collaboration between not-for-profits, Universities, and private industry, such uncertainty is a growing concern for university researchers and not-for-profits.
This uncertainty has the potential to prevent innovation - a result which would be in conflict with the intent of the patent system.
As mentioned in previous chapters, one of the problems with bulk sequence claims (not just for researchers but also for patent examiners) is the fact that these applications often try to claim homologous sequences in many other organisms. This is done typically by the use of such claim language as:
|
|
(Note that this is a claim from the Dow Chemical Co. application we discussed in Chapter 6, which contains many Nicotiana benthamiana and "Artificial" sequences )
Although this is not the only way of claiming many related sequences. It also has the added "advantage" (if you are the applicant!) of being difficult to easily quantify, and hence offers the opportunity of obtaining the broadest possible claim scope. Those "skilled in the art" will realise that many (possibly only distantly related!) nucleic acid sequences will hybridise together under low stringency conditions. Further, the term ".., wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic." provides a way of claiming such nucleic acid sequences with utility.
The probability that an applicant will achieve such claims to all 7554 sequences is small. However, it is likely (and no doubt the applicant expects) that some of these sequences will be claimed in successful future applications. The uncertainty with this claim is two-fold:
At this point we see the connection between such broad bulk-sequence claims in other dicoots (Nicotiana benthamiana claim above) and Arabidopsis. It is very likely that the bulk of the genes/DNA sequences in claim 1 above will hybridise under low stringency conditions with the corresponding Arabidopsis DNA sequences. Hence bulk sequence claims for other dicots influence Arabidopsis in the same way that the bulk sequence claims of Paradigm Genetics Inc. would read on sequences from Nicotiana benthamiana!
|
Bulk claims for Arabidopsis
thaliana and Nicotiana benthamiana sequences, may not only
interfere with each other, |
|---|
The search strategy we used in this case was different from those used previously, as it made use of the term "seqdata". Sequence listings of considerable size (greater than 300 pages, i.e. those likely to represent larger bulk-sequence applications) are now available for separate download from the USPTOs website (PSIPS). Most recent, but perhaps not all, applications and patents with large sequence listings can thus be identified with the serach term "seqdata". A term that forms part of the referring URL found at the end of such patent documents. Hence this term offers a way of limiting searches to recent bulk-sequence documents. Another option available is to go to the USPTO's PSIP site and search the listings via publication date. This latter option is difficult since there are >1000 patent documents to browse, which contain large sequence listings.
One of the patents discussed in this chapter were identified using the following search strategy.
|
Search details |
|
|---|---|
|
Date of search |
21 07 2006 |
|
Database searched |
Patent Lens, accessible at www.patentlens.net |
|
Type of search |
Structured Search |
|
Collections searched |
US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in Collection" which will take you to a web page that describes the extent of the collection.) |
|
Search terms |
seqdata AND plant |
|
Results |
328 |
|
Comments |
From the 328 results identified using these search terms, we chose several that delt with sequences from dicots. These were used for comparisons with Arabidopsis patent documents and sequences. You may find other patents or applications of interest based on your particular circumstances. We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search. We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link. |
The following patent documents demonstrate attempts to patent large groups of sequences from other dicots. These applications use claim language such that a successful claim would almost certainly cover large numbers of sequences from other dicots, and perhaps also some from Arabidopsis. It is therefore useful to investigate briefly the claim contents of such applications - particularly since many of these applications are more recent than those of Paradigm Genetics, and, unlike the applications of Paradigm, appear to be in the process of active patenting efforts by the respective companies.
The claims of the Agrigenesis patent below refer to sequences derived from Eucalyptus grandis and Pinus radiata:
|
Patent or Publication No. |
Title, Independent Claims and Summary |
Applicant |
|||||
|---|---|---|---|---|---|---|---|
|
Title - Compositions isolated from plant cells and their use in the modification of plant cell signaling
The claims are generally drawn to:
The sequences above were derived from:
|
Original Applicant: |
|||||
[Summary for overlapping claims from one organism to another. Leading into and connection with genome mapping of patents in Chapter 10]
In Chapter 8 we expanded upon the idea that claims originating on sequences from one organism, for example Arabidopsis, may also claim similar sequences from other organisms. In particular claims to sequences from a model organism (that may have little economic importance) may include hybridisation and/or similarity language that may effectively claim sequences from economically important crops. So companies that seek to claim large numbers of genes from Arabidopsis are effectively also attempting to claim similar sequences from important crop plants.
The degree to which such claims may read on other organisms is very difficult to calculate due to:
For each of the points above, we have endevoured to develop a solution, an improvement, or an approximation, that gives a better idea of the state of inter-genome claims.
The plot below shows the total number of Arabidopsis thaliana patent sequence filings (blue bars) as well as the number of applications claiming sequences that match a segment of the the Arabidopsis thaliana genome at least 150bp long (green bars) from 1994 to 2006. Because published US application data was not available prior to 2001, the low number of claimed sequences and sequence filings from 1994 to 2001 may be largely attributed to a lack of data from the USPTO rather than a lack of filings.
This plot shows the average number of sequences referenced in claims since 2001.
US Patent Applications on Arabidopsis thaliana
Granted US Patents on Arabidopsis thaliana |
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For patent applications that claim Arabidopsis gene sequences, the plots below show the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment.
In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
The Arabidopsis genomic data contains some vector contamination for chomosomes 2 and 3 resulting in some spikes that go off the scale of the chart.
US Patent Applications on Arabidopsis thaliana
Granted US Patents on Arabidopsis thaliana |
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. The plots below show the number of US patent applications that claim sequences that have at least a 150 bp match to a 300 kbp genome segment.
In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
US Patent Applications on Arabidopsis thaliana
Granted US Patents on Arabidopsis thaliana |
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For each 300 kbp segment, the percent coverage by claims in US patent applications was calculated. For example, if an entire 300 kbp segment was claimed by US patent applications, the plot would read 100% for that segment. The Y axes show the percent coverage, and the X-axes show the region (locus) of the Arabidopsis genome. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
US Patent Applications on Arabidopsis thalianaGranted US Patents on Arabidopsis thaliana
|
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For granted patents that claim Arabidopsis gene sequences, the plots below show the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment.
In addition to initial patent filings, continuation and divisional applications are included in the count, so in some cases, the same sequence is counted multiple times. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
US Patent Applications on Arabidopsis thalianaGranted US Patents on Arabidopsis thaliana
|
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For each 300 kbp segment, the percent coverage by claims in granted US patents was calculated. For example, if an entire 300 kbp segment was claimed by US patents, the plot would read 100% for that segment. The Y axes show the percent coverage, and the X-axes show the region (locus) of the Arabidopsis genome. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
US Patent Applications on Arabidopsis thalianaGranted US Patents on Arabidopsis thaliana
|
For the plots below, the Arabidopsis genome was divided into 300 kbp segments. The plots below show the number of granted US patents that claim sequences that have at least a 150 bp match to a 300 kbp genome segment.
In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.
See the corresponding plots for the rice genome landscape.
A number of companies have claimed Arabidopsis genes sequences in U.S. patent applications and issued patents. Here we present graphs showing the percentage of each chromosome that is claimed in patent applications and patents assigned to seven different companies.
For the plots below, the percent coverage for each rice chromosome is shown. For example, if the entire sequence of a particular Arabidopsis chromosome were claimed in patents or applications, the plot would read 100% for that chromosome. The Y axes show the percent coverage, and the X-axes show the chromosome of the Arabidopsis genome. Note: the scales of the Y axes have been adjusted to reflect the data for each company.
Applications
Granted Patents
Applications
Granted Patents
Applications
Applications
![]()
Applications
Granted Patents
![]()
![]()
Applications
Applications
Granted Patents
![]()
![]()
Applications
Granted Patents
![]()
![]()
Applications
Granted Patents
![]()
![]()
Applications
Granted Patents
![]()
![]()
The first steps in understanding the extent to which claims from one organism overlap into the genome of another is to develop the databases, software, and methodology to answer the question. The next step we chose was to perform the most straightforward analysis, both to test the methodology and to obtain a "baseline" for comparisons of future analyses. We set out to:
These initial steps were used to attempt to reproduce aspects of the analysis performed by Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.) on the human genome map for Arabidopsis. Essentially, claimed sequences from US-granted patents were megaBLASTed against the Arabidopsis genome sequences, and sequences of interest (longer than 28bp, with expect values of <1 x 10e-200 and highest bit scores) were identified. The genome positions of each of these sequences of interest were identified and grouped together into regions. Each region spans a 300kb region of an Arabidopsis chromosome. The number of patents in each region was then mapped to the region position on the respective chromosome.
The end result of this analysis was a dataset of patents containing Arabidopsis sequences in their claims, linked to the specific map location of those sequences on the Arabidopsis genome. The sequences and matched genomic regions are made using relatively highly specific search criteria. Resulting in "exact" matching of claimed sequences to their corresponding genomic sequence. No attempt was made to identify homologous sequences within the genome, which may also be claimed by the patent document. Hence the maps obtained are likely to under-represent the actual sequence claims in Arabidopsis, from patents claiming Arabidopsis sequences.
|
Note also: This analysis does not deal with those sequences that are present in patent applications, and most importantly will not show sequences appearing in the claims section of bulk sequence applications! |
|---|
|
|
By performing a modification of the analysis of Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.), a member of our group was able to map patented sequences from US granted patents onto the Arabidopsis chromosomes. The following 5 maps demonstrate the distribution of patents across Arabidopsis chromosomes. Note carefully the limitations of this analysis:
(see below for more details of these analyses) (Chromosome 1 = 30Mb) |
|
|
In all chromsome maps the Y-axis is the number of patents found within 300Kb sections of the chromosome map. Chromosome 2 |
![]() |
Chromosome 3: Note:
|
![]() |
Chromosome 4 Note: Sequences contained in US applications were not included in any of these chromosome maps. All of the bulk sequence applications discussed in the text below are not included in this map. |
![]() |
Chromosome 5 The same Y-axis scale has been used throughout, to allow the user to compare the relative "intensities" of patenting activity on each of the chromosomes. Different X-axis scales have been used, to allow optimal resolution of patent activity. The relative sizes of the different chromosomes are represented in the chromosome graphic below the X-axis. |
|
Note: Because of the nature of the analysis above (see the notes on the right of the maps above), the mapped sequences are likely to greatly under-represent the true degree of patent activity on the Arabidopsis genome! |
|---|
During the mapping of patent documents to the corresponding regions of the Arabidopsis map it was noticed that there are a number of places on the map that have a higher-than-average patenting activity.
These regions included:
| Arabidopsis
Chromosome |
Map Position |
Number of Patents |
Number of Sequences |
|---|---|---|---|
|
2 |
3.6 |
6 |
7 |
|
3 |
3.6 |
10 |
14 |
|
3 |
13.5 |
7 |
78 |
|
4 |
12.9 |
8 |
10 |
From this table further analyses revealed the following:
Arabidopsis Chromosome 3 and the 47-patent hot spot at Map position 13.5Mb:
Analysis of the sequences from these 47 patents (by BLAST at NCBI) demonstrated that a ~450bp fragment from a cloning vector contaminates the chromosome sequence at this position. The large number of patent documents matching at this site are a result of matches with this region from patent sequences unrelated to Arabidopsis sequences. Our method is sensitive to such "contamination" since it relies on almost identical matches with the chromosomal sequences of Arabidopsis to identify patented sequences (and hence patents of interest from Arabidopsis). It is interesting that this sequence contamination has persisted in the Arabidopsis genome sequence and suggests that the methods used to screen for vector contamination are not entirely reliable.
Arabidopsis Chromosome 3: The 10-patent-hot-spot at Map position 3.6Mb:
US patents appearing at this hotspot include those listed in the following table:
|
Patent |
Issue Date |
Title |
Status |
Applicant |
|---|---|---|---|---|
|
06-15-1999 |
Arabidopsis spindly gene, methods of identification and use |
Expired |
Regents of the University of Minnesota |
|
|
10-12-1999 |
Strong early seed-specific gene regulatory region |
In force |
Monsanto Company, Inc |
|
|
09-18-2001 |
Production of hydroxylated fatty acids in genetically modified plants
|
In force |
Carnegie Institution of Washington, Monsanto Company Inc. |
|
|
10-30-2001 |
Plant fatty acid hydroxylases |
In force |
Carnegie Institution of Washington, Monsanto Company Inc. |
|
|
01-29-2002 |
Fatty acid desaturases and mutant sequences thereof |
In force |
Cargill, Inc |
|
|
04-16-2002 |
Genes for microsomal delta-12 fatty acid desaturases and hydroxylases
from plants |
In force |
E.I. du Pont de Nemours and Co |
|
|
11-04-2003 |
DNA encoding for plant digalactosyldiacylglycerol
galactosyltransferase and methods of use |
In force |
Board of Trustees operating Michigan State University |
|
|
03-29-2005 |
Genes for microsomal delta-12 fatty acid desaturases and related
enzymes from plants |
In force |
E. I. du Pont de Nemours and Co |
|
|
07-19-2005 |
Genes for microsomal delta-12 fatty acid desaturases and related
enzymes from plants |
In force |
E. I. du Pont de Nemours and Co |
|
|
11-22-2005 |
Fatty acid desaturases and mutant sequences thereof |
In force |
Cargill, Inc |
Essentially this hotspot refers to (amongst others) claims against fatty acid desaturase (FAD) genes from Arabidopsis and related organisms such as Brassica spp. Some of the sequences mapping to chromosome 3 of Arabidopsis are in fact the FAD gene from rape (Brassica napus). The importance of FAD claims is underlined by the presence of large private companies involved in agritech research and development: Monsanto, Du Pont, and Cargill. Manipulation of FAD levels in plants is one way in which the ratio of saturated to unsaturated oils might be controlled. Such manipulation is of interest due to the human health implications and industrial applications (including biodiesel) possible through the generation of crop plants with modified levels of unsaturated oils. The patents listed above are an example of a small patent thicket that has arisen around an industrially-important gene family (in this case FAD from Arabidopsis and Brassica sp.). Although beyond the scope of the present landscape: It is interesting to speculate that the FAD genes from other species (e.g. Gossypium hirsutum) may also be the focus of patenting activity.
Arabidopsis Chromosome 2 and the 6-patent hot spot at Map position 3.6Mb:
US patents appearing at this hotspot include those listed in the following table:
|
Patent |
Issue Date |
Title |
Status |
Applicant |
|---|---|---|---|---|
|
11-13-2001 |
Chimeric somatostatin containing protein and encoding DNA, plasmids
|
Expired |
Tikhonenko; T. |
|
|
04-16-2002 |
Process and materials for production of glucosamine |
In force |
Arkion Life Sciences LLC |
|
|
03-25-2003 |
Human porphobilinogen deaminase sequences |
In force |
Hemebiotech A/S |
|
|
12-23-2003 |
Method and phage for the identification of nucleic acid sequences encoding members of a multimeric (poly) peptide complex |
In force |
Morphosys AG |
|
|
08-23-2005 |
Methods and means for producing efficient silencing construct using recombinational cloning |
In force |
Commonwealth Scientific and Industrial Research Corporation; |
|
|
12-06-2005 |
Plant chromosome compositions and methods |
In force |
The University of Chicago |
Again, most of the patenst above do not deal with direct claims to sequences from Arabidopsis or other plants. Instead there appears to be some vector contamination in the chr2 sequence from Arabidopsis, and this appears to account for much of the "hotspot" activity here.
Arabidopsis Chromosome 4 and the 8-patent hot spot at Map position 12.9Mb:
US patents appearing at this hotspot include those listed in the following table:
|
Patent |
Issue Date |
Title |
Status |
Applicant |
|---|---|---|---|---|
|
01-05-1999 |
Oil bodies and associated proteins as affinity matrices |
In force |
Sembiosys Genetics Inc |
|
|
04-06-1999 |
Method for regulating cold and dehydration regulatory genes in a
plant |
In force |
Michigan State University |
|
|
04-06-1999 |
DNA and encoded protein which regulates cold and dehydration
regulated genes |
In force |
Michigan State University |
|
|
03-06-2001 |
GA 20-oxidase gene sequences |
In force |
Long Ashton Research Station |
|
|
07-09-2002 |
Plant having altered environmental stress tolerance |
In force |
Michigan State University/Mendel Biotechnology Inc. |
|
|
12-17-2002 |
Genes encoding plant transcription factors |
In force |
Independent Administrative Institute Japan International |
|
|
12-30-2003 |
Environmental stress-tolerant plants |
In force |
Independent Administrative Institute, Japan International |
|
|
03-16-2004 |
Plant having altered environmental stress tolerance |
In force |
Michigan State University |
(Note that the Applicant in the above tables was that listed in the PatentLens database entry for the given patent. Where no company name appeared (only inventors names) a further USPTO search was conducted to identify assignee information. Note: These assignments may not be accurate after the date of issue.)
The plot below shows the the total number of U.S. patent applications containing sequence listings from 2001 through the second quarter of 2005. The plot was constructed with data from applications published since 15 March 2001, when applications were first made available in the XML format. The data do not extend to the current date due to the 18 month lag between when an application is filed and when the application generally publishes.
The graph below shows the average number of sequences referenced in claims over the same time period.
As we saw in the previous analysis of claimed sequences appearing in granted US patents (US-B patent documents), it is possible to obtain an overview of patent activity in Arabidopsis by:
From this we were able to detect patenting "hot-spots" and to identify various Arabidopsis genes of interest to those involved in exploiting related genes.
One limitation of this analysis is the fact that much of the Arabidopsis sequences appear in patent applications, not in granted patents. To overcome this limitation a further query database was constructed, consisting of sequences appearing in the claims section of US patent applications (US-A patent documents). Included in this dataset were sequences from bulk sequence applications for Arabidopsis.
(Note: The construction of this database required considerable effort. The database itself can be found here.)
We repeated the analysis performed above on this new dataset, to determine the extent of patent pending claims on the Arabidopsis genome.
|
This analysis does not deal with those sequences that are claimed through
similarity or hybridisation language. |
|---|
|
|
By performing a modification of the analysis of Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.), a member of our group was able to map sequences appearing in the claims section of US patent applications onto the Arabidopsis chromosomes. The following 5 maps demonstrate the distribution of sequences across Arabidopsis chromosomes. Note carefully the limitations of this analysis:
(see below for more details of these analyses) (Chromosome 1 = 30Mb) |
|
|
In all chromosome maps the Y-axis is the number of patent applications found within 300Kb sections of the chromosome map.
Chromosome 2 |
|
|
Chromosome 3: Note:
|
|
|
Chromosome 4 Note: Sequences contained in US granted patents were not included in these chromosome maps. Although it is unlikely that the inclusion of such data would change the map significantly. As far as possible, bulk sequence applications have been included in this analysis. |
![]() |
Chromosome 5 Different X-, and Y-axis scales has been used throughout, to allow the best possible view of all data. |
|
Note: Because of the nature of the analysis above (see the notes on the left of the maps above), the mapped sequences are likely to greatly under-represent the true degree of patent activity on the Arabidopsis genome! |
|---|
In this genome landscape, we have determined the extent to which the Arabidopsis genome sequences are claimed in both issued U.S. patents and U.S. patent applications. Our initial approach was to reproduce aspects of the analysis performed by Jensen and Murray (Science 310:239-40, 2005) in their analysis of the human genome. Our process entailed a number of informatics steps that are outlined below.
In summary, we compiled a database of patent sequences that are claimed in patent applications, and compared these sequences to the published Arabidopsis genome using a BLAST-type interface. We then determined which portions of the claimed sequences had significant homology to sequences in the Arabidopsis genome, and mapped these sequences to their location on the Arabidopsis chromosome. The results of our analysis are shown in the subsequent pages of this landscape.
We used the most recent Arabidopsis genome sequences from the Arabidopsis Genome Project at NCBI. We then used the formatdb program from NCBI to convert the data to a searchable BLAST database.
Applications
For patent applications, we acquired the sequences of the bulk sequence applications from the Publication Site for Issued and Published Sequences (PSIPS) web site. This web site provides sequence listings for U.S. patents and applications that are longer than 300 pages. We also acquired the sequence listings for the non-bulk sequence listings (fewer than 300 pages in length) that are published by the USPTO as an XML document. For each of the listing types (bulk sequence and non-bulk sequence), there was a separate file for nucleotides and amino acids. Data for U.S. applications have only been available since 2001.
The bulk and non-bulk sequence listings were then converted to a common data format (FASTA) and combined to create one database for nucleotide sequences, and one database for amino acid sequences. Additionally, each of these combined databases was converted to a searchable BLAST database for use with CAMBIA's patent sequence search tool.
Granted (Issued) Patents
For granted U.S. Patents, we had a data source that wasn't available for the applications; GenBank at NCBI has a searchable patent database of sequences disclosed in granted patents. To create our granted patents sequence database, we started by acquiring the U.S. patent sequences from GenBank. This required removing all sequences that originated from non-U.S. patents.
We then acquired the sequence listings from the bulk and non-bulk patents in the manner described above in the Applications section. The data from all three sources (GenBank, bulk, and non-bulk) were converted to a common format. We then carried out a filtering step that removed any duplicate sequences in the data provided by GenBank, and the sequences provided by the USPTO (bulk and non-bulk).
The identical process was carried out for nucleotide sequences and amino acid sequences. As with the applications, each of these combined databases was converted to a searchable BLAST database for use with CAMBIA's patent sequence search tool.
A key feature of our analysis is that we determined which sequences were actually claimed in patents and applications, rather than just disclosed in the specification. To this end, we created four databases that contain only the sequences that are claimed in patent applications. The four databases created correspond to nucleotide sequences in applications, amino acid sequences in applications, nucleotide sequences in granted patents, and amino acid sequences in granted patents.
We compiled a list of common phrases that are used to identify sequence listings in claims. This step was tricky, as there are many different phrases that patent applicants use to designate sequence listings in claims (see examples of phrases) . Using these phrases, we created a list of sequence ID numbers that are designated in patent claims. We then created four new databases that contain only the sequences that are claimed in applications and patents.
After compiling a collection of sequences that are claimed in patents and applications, we then used those sequences to query the Arabidopsis genome database (see step 1) using mega BLAST to identify claimed sequences that have significant homology to sequences in the Arabidopsis genome. We performed this analysis only with the claimed nucleotide sequences in patents and applications.
The sequences that had significant homology to sequences in the Arabidopsis genome were plotted three different ways. The criteria for matches in the database was that they have a BLAST E value less than e-200.
- Sequence Count. For these plots, the Arabidopsis genome was divided into 300 kbp segments. For patent applications that claim Arabidopsis gene sequences, we plotted the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment. For each sequence, only the highest-scoring genome match was counted.
- Patent Count. As with the Sequence Count plots, the Arabidopsis genome was divided into 300 kbp segments. For patent applications that claim Arabidopsis gene sequences, we plotted the number patent applications that claim a 150 base pair or longer fragment of each 300 kbp genome segment. For each sequence, only the highest-scoring genome match was counted.
- Percent Genome Coverage. For these plots, we plotted the percentage of each genome segment that was claimed in patent applications. Unlike the previous two plot types, there was no requirement that the fragments that matched the Arabidopsis genome sequences be a minimum length (e.g., 150 base pairs). However, the minimum size for a positive match using the BLAST interface is 26 base pairs, so matches shorter than 26 were not included in the analysis. In addition, if multiple sequences covered the same portion of the genome, that portion of the genome was counted as being covered only once.
The end result of our analysis was a dataset of patents containing Arabidopsis sequences in their claims, linked to the specific map location of those sequences on the Arabidopsis genome. The sequences and matched genomic regions are made using relatively highly specific search criteria. Resulting in "exact" matching of claimed sequences to their corresponding genomic sequence. No attempt was made to identify homologous sequences within the genome, which may also be claimed by the patent document. Hence the maps obtained are likely to under-represent the actual sequence claims in Arabidopsis, from patents claiming Arabidopsis sequences.
During the patent application process the applicant generally attempts to achieve the widest possible claims. Often this is facilitated by the use of similarity or hybridisation language such as:
[INSERT A FIGURE EXTRACTED FROM A PATENT APPLICATION - ONE CLAIMING ~60% IDENTICAL SEQUENCES]
The mapping analyses performed above only detect sequences that are ~100% identical to the claimed sequences, and miss altogether those that are less-than ~100% identical. This fact, and the existence of such claim language, implies that there are many-other sequence claims that do not appear on our maps. The maps above are therefore under-estimates of the true patenting activity against the Arabidopsis genome.
|
Remember that the converse is also true: claims based on Arabidopsis
genes, which use similarity and hybridisation language, |
|---|
Many of the sequences mapped to the chromosomes above will have homologous sequences elsewhere in the genome. Similarly, sequences from other related-organisms (e.g. Brassica napus, canola) that are <100% identical to the corresponding Arabidopsis gene are probably not present on the map (although some that are almost 100% identical are there already).
The use of hybridisation language also contributes to these cross-genome claims. For example, the following claim would almost certainly claim similar sequences from many other genomes:
|
|
(Patent Application US20040249146: Nucleic acid compositions conferring altered visual phenotypes, DOW AGROSCIENCES)
Due to the complexity and variation in the claim language it is not possible to accomplish an exhaustive claim-by-claim analysis for all patents. For this reason we have chosen what we believe to be a reasonable computational approximation of the problem. The choices made with respect to this new method of analysis were:
|
1. Sequence Database Used as the Query |
The query sequences chosen were those DNA sequences that appear in the claims of both US patent applications and granted patents. |
|---|---|
|
2. Sequence Database Used as the Target |
The target sequences for this analysis were the DNA sequences from all 5 Arabidopsis chromsomes. |
|
3. Initial Sequence Comparison |
Instead of megaBlast analysis, a method optimised for the fast discovery of almost identical sequences, we chose the Blastn algorithm for this analysis. Blastn is suited to the recognition of more divergent sequences than megaBlast and we beleive that it is more appropriate for this analysis. |
|
4. Filters and Variables Used For Blastn Analysis |
To obtain the largest and most comprehensive dataset for analysis in subsequent steps, we set the Expect Cutoff value to be 1 x 10-20 for Blastn (compared to 1 x 10-200 for the previous analyses). Furthermore, we retained only those sequence alignments that were >= 150bases in length for analysis (this is the same length used in the previous analyses). This database forms the core for future analyses based on similarity filtering, and can thus be re-anaylsed using various degrees of similarity (see Step 5 below). |
|
5. Similarity Value |
At this point we chose what we beleive to be a useful and reasonable definition of patent claims to "similar" sequences. Thus, 85% identity was chosen as a reasonable value to use in the similarity filter for this analysis. This is based on our experience with patent documents claiming sequences based on sequence identity. Technically, it is possible to re-filter our dataset using higher or lower valuse of % identity. |
|
6. Discovery of Arabidopsis Sequences with >=85% Identity to the Query Sequence |
The filter was applied and alignments with >=85% similarity were identified. These identified sequences were then aligned over their full length with the corresponding Arabidopsis sequence. Only those with a full-length alignment score of >= 85% identity were retained. |
|
7. Mapping to the Arabidopsis Genome |
Mapping was carried out as in the previous analysis, with matching sequences grouped into 300kb regions and then the totals for these regions mapped as a histogram onto the chromosomes. |
Soya Bean (Soybean/Soya/Soy/Glycine max):
Soybean, or Glycine max, has a long history of domestication as a food source, and the seed has been harvested for food for many hundreds of years in Eastern Asia. This long history of domestication has resulted in the existence of a very large number of soybean variaties anbd cultivars. More recently, soybean has become an important global food crop, grown in Europe, Asia, Australia, Africa, and Northern and Southern America. The plant itself is a legume, and will grow in many soil types, requireing a hot summer growing season. Being a legume, the plant establishes a symbiotic relationship with nitrogen-fixing bacteria and is able to grow in relatively nitrogen-poor soils.
Soybean is rich in both protein and oil. Present day crops are grown for use mainly in processed foods, with only a relatively minor proportion being comsumed directly as food. Soybean and it's processed components finds its way into products such as breakfast cereals, oils, milk replacements, infant formulas, biodeisel, cosmetics, and animal feed.
Soybean and Biotechnology
Due to its economic importance and growing popularity in processed foods, G. max has been the focus of much research, including breeding studies and the production of GMOs (Genetically Modified Organisms). As one important example of this: Monsanto markets a Roundup-ready form of G. max which is resistant to glyphosate-based herbicides (e.g Monsanto's Roundup herbicides). This modification offers a simple way for farmers to plant seed directly into unploughed fields. Spraying with glyphosate-based herbicides eliminates competing weeds without the need for field ploughing prior to planting.
Patenting and genomics
The soybean genome is-as-yet in the early stages prior to genome
sequencing. The Purdue
University consortium has started early work to develop the base for
sequencing the genome of G. max. Hence this genome is only in the
earliest stages of sequencing. However, there are presently 463,101 nucleotide
and 2,615 protein sequences deposited in GenBank at the NCBI (as of 24th July
2006) for Glycine max. Although this dataset is far less than that
expected for the entire genome, it is possible to use the available sequence
data to predict the amount of similarity between Glycine max and
Arabidopsis thaliana genes and to determine the degree of cross genome
claims in patents.
Glycine max has a genome size of 1100Mb, almost 10-times larger than Arabidopsis thaliana, with a diploid chromosome count of 40 (Genetics. 2005 Jul;170(3):1221-30.). It belongs to the pea family, Fabaceae, along with peas, beans, and other legumes. We chose soybean as the Arbidopsis comparison for this analysis based on the following:
Goal: Demonstrate to users through bioinformatics analysis and graphical presentation of data:
|
Date |
DB and settings |
Search Terms |
Patent |
Notes |
|---|---|---|---|---|
|
20060602 |
PLens Stem-on Def DBs |
28,715 |
Too many |
|
|
20060602 |
PLens Stem-on Def DBs |
2,655 |
too many |
|
|
20060602 |
PLens Stem-on Def DBs |
4,945 |
tm |
|
|
20060602 |
PLens Stem-on Def DBs |
143 |
||
|
20060602 |
PLens Stem-on Def DBs |
132 |
11 contain thale AND cress but not Arabidopsis!!! |
|
|
20060602 |
PLens Stem-on Def DBs |
thale |
916 |
|
|
20060602 |
PLens Stem-on Def DBs |
cress |
2,149 |
|
|
20060602 |
PLens Stem-on Def DBs |
192 |
||
|
20060602 |
PLens Stem-on Def DBs |
160 |
32 contain mouse and ear and cress do not have Arabidopsis!!! |
|
|
20060602 |
PLens Stem-on Def DBs |
arbidopsis OR arabdopsis OR arabidpsis |
135 |
common spelling mistakes? |
|
20060602 |
PLens Stem-on Def DBs |
2 |
in title and abstract (only apps.) |
|
|
20060602 |
PLens Stem-on Def DBs |
128 |
7 docs have only the misspelled form! |
|
|
20060607 |
PLens Stem-on Def DBs |
Arabidopsis AND expressed AND (SEQ AND ID) AND sequence |
9,544 |
All 9 PG patents are top of the list (sorted via relevance). First 100 patents all seem to have Arabidopsis in title (good) |
|
20060607 |
PLens Stem-on Def DBs |
Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence |
3,984 |
As above, but "better" by reduction in hits(?) |
|
20060607 |
PLens Stem-on Def DBs |
Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST |
2,287 |
All PG Inc patents at top of list, less hits, but now many non-At patents in list (incl human) |
|
20060607 |
PLens Stem-on Def DBs |
Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) |
1,476 |
As above, now with Mendel's app showing, still has long list of human apps following though. |
|
20060607 |
PLens Stem-on Def DBs |
((Arabidopsis in abstract) OR (Arabidopsis in title)) AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) |
52 |
Bad - Mendel's app removed, no At in title or abstract! (Plant not At!) |
|
20060607 |
PLens Stem-on Def DBs |
(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) |
801 |
Good, PG and MBs present and some from Monsanto (not At though!) |
|
20060607 |
PLens Stem-on Def DBs |
(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND (expressed AND Arabidopsis) AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) |
699 |
Good, PG and MBs present, Monsanto gone from top 100. |
|
20060607 |
PLens Stem-on Def DBs (filing date limit 1998 onwards) |
(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND (expressed AND Arabidopsis) AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) |
657 |
Little improvement |
|
20060608 |
PLens Stem-on Def DBs |
(((plant OR dicot OR arabidopsis) in abstract) OR ((plant OR dicot OR arabidopsis) in title)) |
75,964 |
Huge! |
|
20060608 |
PLens Stem-on Def DBs |
(((plant OR dicot OR arabidopsis) in abstract) OR ((plant OR dicot OR arabidopsis) in title)) AND NOT ((monocot in abstract) OR (monocot in title)) |
75,761 |
Huge, little improvement! |
|
PLens Stem-on Def DBs |
||||
|
PLens Stem-on Def DBs |
Expressed Sequence Tags (ESTs)[add a comment]
The availability of the Arabidopsis genome and gene sequences represent an opportunity to investigate the influence of such data on patenting, patenting opportunities, and patenting strategies of stakeholders. Such an investigation is useful, not only to those patenting related "inventions", but also to those who use the data, the model system, or who intend to use similar systems. Whilst other genomes, including those of bacteria (e.g. E. coli), invertebrates (e.g. C. elegans and Drosophila), and the human genome (Jensen and Murray, ref###) are also useful in this regard, the availability of an analysis for a popular plant genome offers many advantages for those in the plant sciences and/or agribusiness. Understanding what has happened in Arabidopsis with respect to patenting allows us to better understand:
Although most of the members of the AGI were not for profits or government institutions, funded through public money, companies such as Monsanto (throught Cereon Genomics Inc) and (###) Perkin Elmer Applied Biosystems (### CHECK THIS) did support their efforts. Monsanto also sequenced the related Landsberg genome, and has supplied this data to the not-for-profit research community. Hence the sequencing and sequence data that exists today for Arabidopsis was affectively acheived through collaboration between private and public organisations. The end result of this collaboration, the genomic sequence, the ESTs, and the genetic markers are available to scientists throughout the world (either directly via public databases such as GenBank and dbEST, or under agreement with Monsanto for the Landsberg data).
However, a number of problematic issues exist with respect to Arabidopsis sequence data, particularly the data that was (and still might be) the focus of bulk sequence claims made by Paradigm Genetics Inc, and to a lesser extent those made by Mendel Biotechnology Inc. and Ceres Inc. Such companies have sought to protect their investment in Arabidopsis sequencing and research through patenting of large groups of genes. The initial step in such a strategy has been to apply for patent protection over large groups of genes. This "maximalist" strategy is not uncommon in patenting strategies focused towards other similar technologies, and arguably makes good business sense. Unfortuneately, once applications for large groups of sequences are made, this generates uncertainty for others in the field.
Almost paradoxically, patent applications may have a similar or greater influence on downstream deliverability of products and services than patents themselves. Compared to granted patents, applications may have the following properties:
One drawback of the ability of applicants to make applications for many hundreds or thousands of genes and/or nucleic acid sequences is the uncertainty this generates in the minds of potential users of the technology.
Uncertainty can take many forms, and this can result in researcher/strategists asking questions such as:
This uncertainty may form a barrier to use of the technology by others. Additionally, as we have seen in previous chapters, the strategies used by some applicants seem to rely heavily on establishing an early priority date for bulk sequence applications and then supplying "inventiveness" and "enablement" at some much later date. It is also possible for owners of the IPR to license technologies that are not-as-yet patented.
These properties of patent applications and the uncertainty generated by them produces a de facto protection for the technology under application. Hence, although most of the claimed sequences present in bulk sequence applications will never produce a granted patent, they are in effect "protected" by the applicant and some aspects of the monopoly possible under a granted patent (de jure protection) is acquired by them.
As we have seen in the case of bulk sequence applications, few sequences from applications have made their way into granted patents. Possibly because applications themselves offer a form of protection that allows income to be derived from licensing without the need for a granted patent. At this point granted patents appear to hold far less Arabidopsis sequences than patent applications. This is almost certainly due to the economies involved in making and maintaining applications rather than generating, maintaining and defending granted patents.
|
Imagine the cost of producing granted patents for 1,800 transcription factors
at ~US$20,000 each! |
|---|
It is unlikley that the originators of the US patent office envisaged that continuation and divisional applications would be used in this way, since the original system was established essentially to allow monopoly protection of mechanical inventions and not genomic information. Arguably, granted patents offer a far higher degree of certainty for both applicant, licencees, and business than does a patent application. However, the creation and maintenance of uncertainty (either wittingly or unwittingly) provides in some cases a more economical business tool to monetise a technology.
Charting an IP landscape surrounding any genome is a large undertaking. To ensure that this landscape is useful to as many as possible and is functional as soon as possible, we have chosen to analyse briefly only a small part of the IP surrounding the Arabidopsis genome. The choice to focus our analysis on only patent documents involving bulk sequence claims was made to ensure we could meet these initial goals. As with any landscape, constant revision and additions are required to keep the cartography accurate and relevant. With this in mind, we do not intend that this present version of the landscape is finished. It is instead a first step towards a more detailed landscape. The intention has always been that this document should by dynamic and flexible, and that user comments would help shape future landscape efforts. [add a comment]
Towards this end, the landscape is marked throughout with user commentable regions. We would be very interested to hear your comments about any of the regions of the landscape which you feel could be improved. We are particularly interested to hear from people who have first hand experience in these matters (Arabidopsis research, patenting, licensing, and legal aspects). User feedback will help us in directing future efforts to expand and update the landscape (and, if necessary, make corrections to the existing landscape). [add a comment]
Providing your input and help at an early stage of this project may very well create a landscape that provides yourself and users like you with a useful and valuable resource.[add a comment]
CAMBIA
29th June 2006[add a comment]
The information contained in this page was believed to be correct at the time it was collated. New patents and patent applications, altered status of patents, and case law may have resulted in changes in the landscape. CAMBIA makes no warranty that it is correct or up to date at this time and accepts no liability for any use that might be made of it. Corrections or updates to the information are welcome, please send an email to info@bios.net.