Arabidopsis Genome

flower2

This analysis was prepared by Dr. Brett Baillie and Dr. Marie Connett Porceddu using patent application and patent data provided by Neil Bacon and Doug Ashton from within a larger project funded by the OS4 Initiative (link) .  We are grateful for production assistance on many of the pages here by Dr. Kerry Fluhr, Dr. Wei Yang , Dr. Nick dos Remedios, and Nik Hatta.

Patent Landscape of the Arabidopsis genome and related plant species

This landscape deals with the Intellectual property and IP issues surrounding the genome of Arabidopsis thaliana. The landscape is intended to provide insights into the deliverability of innovations in the plant sciences for the public good. To this end the landscape investigates patenting activity directed towards the nucleotide and gene sequences from Arabidopsis - with a particular focus on patent applications involving large groups of DNA sequences. The results of the analysis will be of interest to those involved in public sector research and innovation who are unaware of the ways in which the fruits of public-funded research can be sequestered by private for-profit organisations.

Patent applications that claim Arabidopsis genes have the potential to also claim related genes from other organisms. Hence granted patents on genes in Arabidopsis may effectively grant exclusionary rights to similar genes from important industrial and subsistence crops such as canola, soya, potato, cassava, cotton, poplar and Eucalyptus.

Understanding the landscape of patents on Arabidopsis sequences also allows a better understanding of the degree to which these overlapping patent rights may influence future research and innovation. If findings about the function of Arabidopsis genes provide stronger enablement for composition of matter claims already filed, research done in the public sector may not actually be in the public domain or available for public good uses!

What this Landscape is about

This landscape will provide an overview of some important aspects of the intellectual property claims covering the Arabidopsis genome, which many in the biological sciences consider to be a public good, a model system that enables critical insights into many of the important pathways of crops, weeds and other plants.  In particular, we investigated "bulk sequence claims", patent applications that attempt to claim  100 or more Arabidopsis sequences, and found a large number of such patent applications by Monsanto, Syngenta, Dow, Dupont, Bayer, and BASF and their subsidiaries and researchers who work with these companies under exclusive contracts.

What this Landscape is NOT about

This landscape is not intended to provide a legal opinion about freedom to operate using Arabidopsis sequences.   The nature of patenting systems worldwide means that new patents and patent applications may appear at any time.  Similarly, patents may lapse or patent applications may expire or be replaced by new applications.  During the course of preparation of this landscape in early 2007, a series of new publications appeared, providing information for the first time on applications encumbering thousands of sequences, filed up to seven years earlier.  This provided a firm reminder that it is not possible to take for granted that if a topic is not readily found in a search of published patents and patent applications, there is freedom to operate!  So, although we have tried to give the best coverage of the intellectual property surrounding Arabidopsis sequences, this landscape should not be viewed as a comprehensive coverage of the subject.

What You Need to Know About Patents

Below are some important patent law concepts that are important for understanding the Rice Genome Landscape:

  1. The claims define the scope of protection of a patent.

  2. A patent application is not the same as a patent.

  3. There is no such thing as an international patent.

  4. The entity that has the patent rights may not be the owner listed on the patent document.

  5. Individual patents have a limited lifetime.

What is Arabidopsis thaliana?

Arabidopsis thaliana is a model plant used in research to study many aspects of plant biology.  Originating in Europe and Central Asia, it is a dicot, as are many important and culturally significant staple crops, such as potato;  commercially important food crops, such as soybean; and fiber crops, such as cotton and hardwood trees.  Unlike many other members of the family Brassicaceae (the same family contains canola, broccoli, cabbage, mustard and other food and oilseed crop species), Arabidopsis thaliana is not grown for food or oil production, and is not agronomically important in itself.  Plants are quite small (see Figure 1 below).

Although in the genus Arabidopsis there are approximately 9 species and 8 sub-species (see the entry for Arabidopsis in Wikipedia), it is not uncommon for plant biology researchers to use "Arabidopsis" to mean Arabidopsis thaliana, the convention used in this landscape.

Arabidopsis1

Figure 1: Arabidopsis thaliana seedlings growing in tissue culture (seedling ~10mm in width)

The popularity of  as a model for all plant processes has given rise to numerous methods and protocols for its use in research. Because germination through to senescence is only approximately 50 days (The Arabidopsis Information Resource (TAIR)), Arabidopsis offers a fast system in which to study processes that may take months or years in other flowering plants. The availability of large numbers of mutants and T-DNA insertion lines (e.g. The European Arabidopsis Stock Centre, NASC) has made it useful in understanding the role of many plant genes, particularly those involved in development, metabolism, and disease resistance.

Arabidopsis was chosen as the first plant species for a public whole genome sequencing effort because of its small genome size. The table below shows the genome sizes of some commonly-studied organisms, including Arabidopsis.

Organism

Common Name

Genome Size

Escherichia coli K12 E.coli

        4.64 Mb

Saccharomyces cerevisiae YJM789

baker's yeast

      16 Mb

Arabidopsis thaliana

thale cress

    120 Mb

Oryza sativa

rice

    430 Mb

Lycopersicon esculentum

tomato

    950 Mb

Zea mays

maize

  2365 Mb

Homo sapiens

man

  3038 Mb

Triticum aestivum

wheat

17000 Mb

(Genome sizes were taken from the NCBI's Genomic Biology web pages.)

The future of Arabidopsis thaliana research?

The availability of sequence data for the Arabidopsis genome and EST projects has ensured that Arabidopsis is now more widely used than ever as a model for the study of flowering plants, and new or improved enabling technologies are being applied to it as they become available for research.  For example, with the availability of genomic sequence and EST data, gene silencing or RNAi technology is being used as a research method to determine the function of Arabidopsis genes.  The Arabidopsis Genomic RNAi Knock-out Line Analysis (AGRICOLA) project was started in 2002 with the goal of providing the scientific community with research tools (specific vectors and cloned Arabidopsis DNA) necessary to study gene function via gene silencing. 

Understanding the patent landscape surrounding Arabidopsis is important in planning research that aims for application in products, because of the continued importance of Arabidopsis as a research tool to many thousands of scientists. Furthermore, use of the genomic information of Arabidopsis in patents and patent applications has resulted in broad claims for patent coverage of genes in other flowering plants.

Khush GS (1997) Origin, dispersal, cultivation and variation of rice. Plant molecular biology 35 (1-2), 25-34 (Sep 1997)

The Genomic Structure of Arabidopsis thaliana

See background information about the structure of chromosomes. The genome of Arabidopsis thaliana consists of 5 chromosomes, one circular mitochondrial DNA, and a circular chloroplast DNA. Chromosomes are often drawn as in Figure 1 below: with a short and long arm and an intervening centromeric region. Figure 2 depicts the shapes and relative sizes of the A. thaliana chromosomes. Table 1 shows the size (in Mbp) and predicted number of genes for each chromosome. 

Chr Figure

At Chrs figure

Figure 1

Figure 2

Table 1

Chromosome

Size
(Mb)

Predicted Genes

Chr1

30.4

7863

Chr2

19.7

5115

Chr3

23.5

6255

Chr4

18.6

4811

Chr5

27.0

7084

Mitochondria

0.366

135

Chloroplast

0.154

122

Total

~120Mb

31,385

Arabidopsis thaliana Genomic Sequencing

The Arabidopsis Genome Initiative (AGI) was set up in 1996. Consisting of 6 groups, it's mission was to sequence the genome of Arabidopsis thaliana by 2004 at high accuracy. Various aspects of the sequencing mission were divided between the 6 groups:

AGI

Links:

Stanford University
The Institute for Genome Research (TIGR)
Kazusa
Genoscope
Cold Spring Harbor Laboratory
European Scientists Sequencing Arabidopsis (ESSA)

The sequencing project was completed in 2000, four years ahead of schedule.

The sequence derived from the AGI's efforts were the subject of a number of publications in nature, including:

Before, during, and subsequent to the AGI's efforts, groups, including some of those involved in the genomic sequencing, were involved in sequencing Expressed Sequence Tags (ESTs) from Arabidopsis. The following companies and institutes were involved in these efforts:

Today there are more than 622,000 Arabidopsis ESTs in the dbEST database at the National Center for Biotechnology Information (NCBI).

Patenting of genes and sequences from Arabidopsis

Analysis of the recent history of patent documents involving Arabidopsis provides some interesting information regarding the evolution of present day patenting strategies for organisms whose genomes are available. It also offers the opportunity to investigate Arabidopsis as a "model" for future patenting of other plant genomes.

The following simple search was performed using PatentLens to search the term "Arabidopsis" in full text patent documents (detailed search parameters here). Although it must be emphasized that this search is likely to overestimate the number of patent documents making claims to Arabidopsis "inventions" (it will also uncover documents making reference to Arabidopsis), its main purpose should be seen as a rough measure of the impact of Arabidopsis on patent publications.

Untitled1

Figure 1: Patent documents containing the term "Arabidopsis" in the full text

  • Although patent documents back to 1976 contain the term "Arabidopsis", there was a rapid increase in the number of documents referring to Arabidopsis between 1990-95.
  • Analysis based on filing date demonstrates a clear decline after 2002 in such documents.
  • A similar trend appears in 2005 for the analysis based on publication date (although it may be too early to tell for certain, it is not unreasonable to expect that the publication date trend will follow, and lag, that of the filing date trend).
  • This pattern reflects what may be expected for a mature technology, where the initial rush to patent a new technology has ended.

Arabidopsis Graph2

Figure 2: Interest in Arabidopsis as gauged from comparisons of patent applications, and journal and sequence publications

Although the number of patent filings of documents containing the term "Arabidopsis" fell in 2004 and 2005, the number of deposited Arabidopsis sequences in GenBank remains high (although there is a large variation in the number of sequences deposited from year to year).  Additionally, "Arabidopsis" appears as a search term in an ever increasing number of PubMed entries at NCBI.

One interpretation of the sequence and publication trends is that interest in Arabidopsis as a model organism remains high, and possibly increases with time.  However, this later interest (after 2003) has not translated into an increase in patent applications.  Such observations are consistent with what may be expected for a mature technology.

(Again, please note that PubMed entries contain the word "Arabidopsis" in the text, they need not necessarily be publications directed towards Arabidopsis. This is similar to the case of full text searches using PatentLens. Hence we argue that the trends in this data are comparable, providing that the user understands the limitations of the datasets and searches.)

One of the caveats with the analysis above is that it does not focus on patents making claim to sequences from Arabidopsis.  Such an analysis is difficult due to the fact that more than one sequence can be claimed in a single patent or patent application.  Hence a plot of the number of Arabidopsis sequences in claims against the year of application may look very different from a plot of patent documents containing claimed Arabidopsis sequences versus year of application.

Hybridization Language in Patent Claims

There are many ways to define the limits of a claim towards related genes, nucleotides, or protein sequences. One way is through a direct comparison of two aligned sequences in order to determine if they are "similar" enough to each other so that the claim language is satisfied. Typically this requires a quantifiable comparison, often in the form of "% identity":

US 2005/50583 (assigned to Genesis/Agrigenesis, according to press releases now subject to the rights of ArborGen) Claim 3

An isolated polynucleotide comprising a sequence selected from the group consisting of:

(a) sequences having at least 75% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;

In this case if a compared sequence (even a sequence not in the possession of the patentee, such as one discovered in another species) falls within the 75% sequence identity range, it could potentially be subject to the patent applicant's claims.  Alternatively, subject to a number of detailed caveats (for the US see 35 USC 102), if a compared sequence more than 75% identical was made public or submitted for patent by someone else prior to the priority date of the patent application ("prior art"), the patent claim may be invalid. 

Such a comparison is technically straightforward, although for a long sequence listing with more than a handful of sequences (almost 500 are claimed here) it would require a large amount of time for the examiner to do a thorough analysis to determine whether or not the claim is valid, and for a researcher or a member of the public to determine whether any particular sequence might be infringing.

Even more difficult for the patent examiner and the public to compare, to clearly define the bounds of what is subject to patent claims, is the use of such claim language as:

US 2005/91708 (assigned to Dow Chemical) Claim 1

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-7554 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic.

Hybridisation experiments would seem to be a definitive way to define similarity in patent documents. Unfortunately, the tools to perform fast and accurate comparisons of sequences based on such language, without the use of a research laboratory, are lacking in the patent office, and what is more, a skilled practitioner will realise that many (possibly only distantly related!)  nucleic acid sequences will hybridise together under low stringency conditions.   At some frequency, any DNA at all will hybridise to the claimed sequence under any stringency conditions. 

Hence hybridisation language, when used in patent documents, has the potential to create uncertainty in understanding the scope of the claim.  The probability that an applicant will achieve such claims to all 7554 sequences is small.  However, it is possible not only that some of these sequences will be successfully claimed in issued patents, but that most remain pending in future applications. The uncertainty with this claim is two-fold:

Below we explain, for those unfamiliar with the concept, how hybridisation works and how it is often presented in a patent application or patent.

The Basis for Hybridisation Language

DNA is a biological polymer consisting of "simple" repeated subunits, or bases.  It is within the sequence of these bases that genetic information is stored. Although DNA can exist as a single strand in solution, it is generally found within living systems together with its complementary strand (see the figure below).  If we consider a simple representational diagram of a DNA molecule, consisting of a sequence of bases: A, G, C, and T. We can see that some of the bases interact with one another in the figure below.

image3

The bases A,T,G, and C form hydrogen bonds with their respective bases T, A, C and G, in the complementary strand.  These bonds provide a stability to the double strand DNA (dsDNA), under normal circumstances, preventing it from dissociating into single strand DNA (ssDNA).  Significantly, dsDNA can be "melted" to produce ssDNA under certain physical circumstances and can be influenced by many factors such as temperature, salt concentration, inorganic solvents, sequence composition, and (importantly) the degree of sequence similarity or missmatch between ssDNA molecules.

The Concept of Melting Temperature (Tm)

Mathematical relationships between the various physical parameters listed above and the melting point of DNA have been established.  Historically, the point at which 50% of the DNA molecule is ssDNA is referred to as the melting temperature or Tm.

The Tm for a DNA sequence can be estimated via the commonly-used calculation:

TmCalc

Where     Tm         = melting temperature in oC
               [Na+]     = Molar concentration of sodium ions in
            %[G+C]    = percent of G+C bases in DNA sequence
                 n         = length of DNA sequence in bases
                 P         = temperature correction for % mismatched base pairs (~1oC per 1% mismatch)
                 F         = correction for formamide concentration (= 0.63oC per 1% [formamide])

Note:  A particular DNA sequence will have different Tm values under different conditions.  This calculation is sometimes referred to as the effective Tm.

The fact that ssDNA can re-anneal to form dsDNA is used in the process called hybridisation. 

NOTE that the Tm refers to 50% hybridisation, not complete hybridisation!

A typical hybridisation experiment consists of a number of steps:

  1. Binding of target ssDNA to a solid support (often a thin nitrocellulose, nylon, or polymer membrane), referred to here as a "blot".
  2. Blocking any remaining free binding sites for DNA on the blot.
  3. Labeling of the "query" ssDNA or probe.
  4. Creation of a hybridisation solution containing the probe at a known concentration of salt, formamide,...
  5. Equilibrating the "blot" with the hybridisation solution at a defined temperature (for a specified length of time).
  6. Washing the unhybridised probe off the blot using a specific wash protocol;  the wash conditions are actually more important than the hybridisation conditions for determining what will remain hybridised, but are less often specified in patent applications!
  7. Detecting the bound probe by some means.

If the probe (or query DNA) is similar enough to the target ssDNA (on the blot) then the probe will remain bound to it after the wash step.

"Stringency" refers to the concentrations and temperatures in steps 4-6 above, and it is most often "strigency" that is referred to in order to determine the scope of claims.  Strictly speaking, this should refer to the "stringency" of the wash step required to leave the probe bound to target sequences covered by the claims:  the "query" is often a sequence mentioned in the claims section (as in the example given above).

Although the table below is not legally binding and indeed patent applications may make their own definitions of stringency levels, the typical usage is roughly:

Stringency

[NaCl] (Molar)

Range from Tm (oC)

Description

High

0.0165 - 0.0330

5 to 10oC below Tm

Only highly similar sequences will bind under these conditions (typically >95%)

Medium/Moderate

0.165 - 0.330

20 to 29oC below Tm

Those sequences above and less homologous ones (typically >80%)

Low

0.330 - 0.825

40 to 48oC below Tm

Sequences with lower homology will bind under these conditions (>50%?)

1. Note that the values here are for the purpose of discussion.  Actual values and conditions must be acquired from the patent document specifications.
2. The "Range from Tm" is the temperature of the wash solution required to achieve the level of stringency and is dependent on the factors described above.
3. Values in Column 2 are typical only!
4. Values in column 3 were obtained from US 2006/0057724 A1
5. Values in Column 4 are typical only and sourced from the web.  Refer to patent specifications for your details!
6. Definition of stringency levels may vary between patents (BEWARE). Stringency conditions are usually defined in the patent document!

Sometimes the patent examiner will require the applicant to stipulate the stringency conditions: high/medium/or low stringency, or specifying the [NaCl] and temperature of the wash conditions.  Few bother to give a reference with respect to Tm (a value that can be calculated).  It is possible that the conditions defined as stringent in one patent document may not be as stringent as those defined in another!

Summary

Arabidopsis is an extremely important model system used to study many aspects of plant biology. From the graphs and figures presented in this chapter, it is obvious that there are a great many references to Arabidopsis in the scientific literature and in patent documents. These observations are consistent with the importance of Arabidopsis as a model organism for the study of plant biology. The interesting aspect of the appearance of "Arabidopsis" in the patent literature is the fact that Arabidopsis is NOT a commercial crop. Compared to important crops such as canola, soybean, and potato there would appear to be little value in patents directed towards Arabidopsis.

While the term "Arabidopsis" appears frequently in patent documents, it is the contents of the "claims" section of these documents that is most important. Since the focus of this landscape is towards the Arabidopsis genome, it is of interest to understand to what extent these claims are directed towards sequences and genes from Arabidopsis. Also of interest is an applicant's reasons for claiming sequences from an organism with little commercial value. In the following chapters we will analyse example patent applications and granted patents more closely to answer some of these questions.

Since patents containing claims to Arabidopsis sequences are the focus of this landscape, much of the analysis in the following chapters will involve patent documents making claims to large groups of genes and sequences from Arabidopsis. These patent documents will be described here as "bulk sequence" applications. For the sake of convenience and clarity, we will define "Bulk Sequence" applications as those that claim protection for 100-or-more Arabidopsis sequences. It is these patent documents that form the core of the following patent analysis.

Bulk Sequence Applications

Within the last decade there has been a trend towards attempts at patenting large groups of nucleic acid sequences from sequencing of EST libraries and genomes. There are “bulk sequence” claims over large portions of many genomes, including that of Arabidopsis.  Since Arabidopsis was the first plant to have its genome sequenced, is a popular model organism, and there are extensive expressed sequence data (ESTs and cDNA sequences) available, examination of the patent landscape of Arabidopsis offers the opportunity to examine what may happen to other plant genomes. 

Bulk sequence applications are important because they potentially cover genes involved in a very wide variety of processes in development, metabolism, and disease resistance. Hence it is a concern to examine the success of such patent applications in granting exclusionary rights to early applicants who can control what can be done with later research in these far-reaching areas.

The effects are not limited to those working on Arabidopsis.   Arabidopsis is not a crop plant.  If the economic gain to be had from patents on Arabidopsis genes was limited to royalties from Arabidopsis growers, there would be few patent applications pending on the genome of this plant.  Most who work on it are using it as a model for important related species such as soybean and cotton.  The claims in many of the bulk sequence applications described below would allow the rights-holders of patents on them to exclude all others from the use of genes not only in and from soybean and cotton, but many other leguminous food crops, fiber crops including trees such as Populus, fruit crops and ornamentals, and subsistence crops such as cassava and cowpea that are staples in the diets of millions of people. 

Thus Arabidopsis offers a further dimension as a plant model: as an example of plant genome patenting.

Patenting Organisations

Historically there have been a number of companies interested in the genomic sequences and expressed sequence tags (ESTs) from Arabidopsis.  Many companies, including Paradigm Genetics Inc. (now Icoria Inc.), Ceres Inc., Mendel Biotechnology Inc., Syngenta, Dow Chemical, and BASF have applied for patents over Arabidopsis sequences.

The ownership of rights to Arabidopsis sequences is complicated by the pattern of mergers, acquisitions, and exclusive research licenses that give control of the intellectual property to entities different from those named as the patent applicants.  In some cases when the ownership of a patent or patent applications changes hands, an assignment of rights is filed with the patent office, but in the U.S. there is no requirement that rights controlled through exclusive licenses be publicly recorded.

Monsanto has equity stakes in or controls the rights over the intellectual property resulting from exclusive research contracts with Mendel, Ceres and Paradigm/Icoria, as noted in press releases.  Monsanto has actually acquired a portion of Paradigm/Icoria (Agrinomics) but in this acquisition and in the exclusive research agreements, information about the extent of the control over the intellectual property has not been made public.  Monsanto has also had extensive historical ties to the Arabidopsis genome sequencing project, and has extensive bulk sequence claims over genes from other species such as cotton and soybean.  Monsanto's efforts to patent ESTs from maize were rebuffed in a court case suggesting a higher standard for utility should be applied for sequence patents (re Dane Fisher case; See also this commentary link), but hundreds of thousands of sequences remain in patent applications over Arabidopsis as well as soybean, maize, rice, and other crops.

Syngenta also controls intellectual property claims through granted and pending patents nominally assigned to a number of companies that have been partially or completely acquired by Syngenta or its parent companies, and through research alliances.   Nominal assignees of Arabidopsis intellectual property probably controlled by Syngenta include Torrey Mesa, Zeneca, Ciba Geigy, Novartis, Advanta, Garst, Agripro and Danisco. 

See bar graphs showing the portions of the Arabidopsis genome that are covered by patent applications and granted patents that are assigned to the following companies and others with whom they have equity stakes and exclusive research alliances:  DuPont, Monsanto, Paradigm Genetics, and Syngenta.

Legal Aspects of Bulk Sequence Claims

Much of the focus of patenting attempts and granted patents for Arabidopsis genes (and those of many other organisms) have been towards patent protection in the US. Other countries, such as Australia, often use the US as a model for guiding their own patent office procedures. Hence, when considering the legal aspects of bulk sequence claims, we will be referring to the current state of affairs in the US and in particular the actions of the United States Patent and Trademark Office (USPTO) towards bulk sequence applications. To understand the basis for patent protection in the US, one must understand that:

A patent applicant is entitled to a patent on an invention if it is novel, inventive, has utility, and if the patent disclosure enables its use.

The default, with respect to applications, is that the applicant is entitled to the patent, and the applicant is given the opportunity throughout the examination process to edit the claims of the application to maximise the chance that a patent is issued. Hence the onus is on the patent office to prove that a patent application fails to meet the criteria for patentability. Patent examiners have a limited amount of time to spend examining an application.

It can be argued that such bulk sequence applications represent attempts to patent "discoveries" (which are not patentable) rather than "inventions". Specifically, the applicant has added to the amount of human knowledge by disclosing something that occurs in nature, rather than adding to human knowledge through the creation of something new (a useful invention). Therefore, it could be argued that bulk sequence applications lack an inventive step.

For an application that claims many sequences, applicants have historically argued that novelty, inventiveness, and utility are met by such large sequence applications. They have done so by pointing out that such sequences can be used (inventiveness and utility) for particular purposes, including:

These proposed uses bring to light another problem with such groups of sequences: Enablement.

The disclosure of information made in "payment" for the grant of a patent must enable a person "skilled in the art" to use the invention. A bulk sequence application may fail to adequately address this issue, since it would be extremely difficult for any applicant to give specific instructions on how to use all the sequences for the purposes listed above.

The USPTO has responded to attempts to patent large groups of sequences by strictly adhering to the requirement for enablement and substantial utility for such applications.

Recently, the USPTO's commitment to these standards were challenged by Monsanto, who had previously applied for patent protection for 32,236 maize ESTs (US patent application 09/619,643), and were refused a patent. Monsanto appealed the USPTO's decision in the Court of Appeals for the Federal Circuit. The appeal was denied by a majority of 2 out of 3 judges, based on a lack of utility and enablement (re Dane Fisher case; See also this commentary link).

Since the re Fisher case was not a unanimous decision, a question still remains as to whether-or-not future large sequence claims will be successful.

Another important point to note about granted patents:

Under US law a granted patent is assumed to be valid.

This fact, in addition to the high cost of contesting a granted patent (generally around one million US dollars), provides an effective barrier to those interested in challenging a patent's validity.

Summary

Bulk sequence applications have (and are still) used to make claims to large numbers of sequences from an organism (including many animals, bacteria, and plants). There is no upper limit to the number of sequences that can be claimed within a single patent application within the US system. Some plant applications may contain more than 100,000 sequences. Hence the success of even a single bulk sequence application would mean that a large portion of a plant genome would be protected by patent claims. Another interesting point is that it is common for companies to license technologies that have a "patent-pending" status (i.e. that have not yet received a granted patent). Hence the protection that might be derived from a granted patent is incentive enough to provide protection in itself. Therefore patent applications (including bulk sequence applications) are also important when considering issues of deliverability, innovation and freedom-to-operate.

Whilst Monsanto's efforts to patent large numbers of maize ESTs were recently unsuccessful, examples of granted bulk sequence patents exist, and applicants continue to pursue bulk sequence applications at the USPTO.

Are there bulk sequence applications for Arabidopsis ?

In the following chapters we will investigate bulk sequence applications for Arabidopsis genes.

Mendel Biotechnology Inc.

Mendel is a US company located in Hayward, California. Deriving its name from the "father of genetics", Gregor Mendel, the company was founded in early 1997 with the idea that controlling gene expression would lead to new ways of manipulating plant growth and development.

Mendel's main interest in Arabidopsis is the 1,800-or-so transcription factors that control the expression of ~27,000 Arabidopsis genes. Accordingly, Mendel has applied for patents over a large number of Arabidopsis transcription factors. Presumably this strategy relies on the fact that control of expression of genes via transcription factors is one way in which useful new phenotypes may be engineered into flowering plants (including many agronomically important crops). Since transcription factors are expected to play important roles in plant development, it is expected that Arabidopsis transcription factors may be functionally and structurally similar to, and highly homologous with transcription factors of other flowering plants.  Thus a claim on Arabidopsis transcription factors may also effectively be a claim over the same gene in many other flowering plants.

Transcription Factors

RNA polymerases (RNAPs) are enzymes involved in the process of transcription (transcribing an RNA molecule using DNA as a template).  While prokaryotic RNAP can recognise gene promoters and initiate transcription from the proper start point, the efficiency of the process is further controlled by trans-acting proteins called Transcription Factors (TFs). In both eukaryotes and prokaryotes, the process of transcription is tightly regulated for many genes and one way in which the cell is able to control transcription is through the use of TFs. In eukaryotes RNAP II requires the presence of a group of auxiliary proteins (general Transcription factors) to be able to recognise gene promoters. Hence production of mRNA in eukaryotes requires the presence of general TFs, and may also be further controlled by the presence of gene-specific TFs.

TFs are proteins that bind to specific DNA sequences within the promoter or enhancer regions of a gene and in so doing are able to control the transcription of the gene. They may either decrease (repressors) or increase (activators) the level of transcription. They are often involved in the final step in a signal transduction pathway and can be activated or deactivated by other proteins higher in the pathway. TFs can be recognised from EST and genomic DNA sequencing projects based on the fact that they often contain sequence motifs such as: zinc fingers, helix-turn-helixs, and basic-helix-loop-helix (for DNA binding) and leucine zippers (for interaction between TFs).

Patent Documents Involving Transcription Factors

We have chosen one patent application by Mendel Biotechnology as an example of its patenting strategy towards transcription factors.  The patent application US 2004/0019927 A1 attempts to claim a large group of Arabidopsis transcription factors and related sequences in plants.  This is the largest of the Mendel patent applications for bulk sequences that we were able to find (although there may be more), others also exist that claim either individual or small groups of TFs. Analysis of granted TF patents are discussed in Chapter 7.

Mendel Biotechnology Announces Agreement With Monsanto

HAYWARD, Calif., July 10 /PRNewswire/ -- Mendel Biotechnology, Inc. ("Mendel") today announced a new research and commercialization partnership with
Monsanto Company ("Monsanto"). The new program represents a significant expansion of the ongoing collaboration between the companies and extends
the agreement through 2011. Mendel and Monsanto have agreed not to disclose financial details of the agreement.

Monsanto and Mendel will continue to collaborate on the commercialization by Monsanto of technologies developed by Mendel. In addition, the
companies will initiate a systems biology program to develop an integrated framework for predictive control of plant gene expression that is anticipated
to streamline discovery and product development activities...

This new partnership extends the relationship initiated in 1997, under which Mendel executed a functional genomics program in the model plant
Arabidopsis focused on understanding the factors that control plant gene expression. That program identified a number of genes that have been shown to
enhance the yield of major crops in extensive field trials and that are in early commercial development by Monsanto.

Press Release    Source: Mendel Biotechnology, Inc.    July 10, 2006

"Four years ago, Chris Somerville, head of plant biology at the Carnegie Institution of Washington, slipped a gene for making plastic into Arabidopsis, a type of mustard plant. The gene turned the plant into a biological plastics factory.  Now, Monsanto Co. scientists are turning the concept into commercial reality....Just as exciting is a recent discovery by Calgene [now owned by Monsanto] scientists of the gene for the enzyme controlling the formation of cellulose in plants. After 30 years of fruitless biochemical search for the enzyme, ''this is our first break in understanding how to control biomass,'' Somerville explains. Genetically boosting the enzyme could make it possible to create trees with much higher proportions of cellulose--the plant kingdom's structural fiber--and less than the normal amounts of other cell wall components. Because these secondary components are what make the pulp- and papermaking process polluting and inefficient, scientists say, the engineered trees could help clean up a major industry. Beyond that, ''there is a mad scramble inplant biology to find the most useful genetic sequences,'' says Somerville. ''The world hasn't even seen the tip of the iceberg.'’ ‘“THE BIOTECH CENTURY”, Business Week, 1997 Chris Somerville is still at the Carnegie Institution, but is also the CEO of Mendel.

Arabidopsis chromosome 3

This diagram identifies a locus in the Arabidopsis genome with many patent applications covering genes in a pathway important for biofuels.   Mendel's patenting (and that of other large companies such as DuPont) is not only over crops critical to worldwide food security, but also industrial uses that could add to the prosperity of developing countries if they were available to exploit.  Who gets to exploit them?

www.mendelbio.com/news/

Moderna, Monsanto link up in new venture

FOCUS - Moderna, Monsanto link up in new venture Reuters, Monday, November 24, 1997 at 20:09 By David Luhnow

MEXICO CITY, Nov 24 (Reuters) - Mexican conglomerate Empresas La Moderna (MEX:MDA.A) and U.S. agricultural giant Monsanto Co (NYSE:MTC) on Monday said they had bought a stake in newly formed agricultural genetic research company Mendel Biotechnology Inc. Each company paid $15 million to fund a five-year research and development project at Mendel, a start-up firm based in San Francisco and headed by some of the world's leading plant genome experts, according to La Moderna officials in Mexico. In return, Moderna and Monsanto each get a 20 percent equity stake in Mendel and rights to develop and commercialize Mendel's technical capabilities in the next generation of value-added agricultural products.

Both companies also have an option to buy an additional 10 percent stake in Mendel at an undisclosed future date, La Moderna company spokesman Dieter Holtz told Reuters. Each firm will have rights to Mendel's research in the areas they are interested in and dominate, officials said. Monsanto will focus on Mendel's advances in agronomics, or the capabilities in plant genetics and genomics for many crops, including corn, soybeans, fruits and vegetables. "This collaboration gives us the abilty to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, Monsanto's assistant chief scientist and chief biotechnologist. Kishore said the venture will greatly reduce research and development time for products that increase yield or otherwise enhance the growing, processing or nutritional characteristics of food.

La Moderna, the world's largest vegetable seed maker, will have access to Mendel's research in vegetable seeds as well as fresh and processed fruits and vegetables. "We will each focus on our core businesses: Monsanto in agronomics and (its herbicide) Roundup and Moderna in vegetable seeds and fruits and vegetables," Holtz said in a telephone interview.

Earlier this year, Moderna sold its Asgrow agronomics division to Monsanto for about $240 million in order to focus more on the Mexican company's strength in vegetable seeds. Mendel's president and CEO is Michael Fromm, a former research director at Monsanto, and its chairman of the board is Christopher Sommerville, director of the Department of Plant Biology at the Carnegie Institution of Washington. "These are the best of the best in the field," Holtz said. The top researchers at Mendel were responsible advances in areas such as disease resistance, the molecular genetics of nitrogen fixation in plants and the genetics of photosynthesis, he added.

"We didn't want to have majority control of the company because we didn't want to cramp their entrepreneurial spirit. We thought it was better to see what they came up with on their own," Holtz said.

Copyright 1997, Reuters News Service

Monsanto, Empresas in Joint Venture with Mendel

MONSANTO, EMPRESAS IN JOINT VENTURE WITH MENDEL Reuters, Monday, November 24, 1997 at 16:22 ST. LOUIS, Mo., Nov 24 (Reuters) - Monsanto Co (NYSE:MTC) and Empresas La Moderna S.A. (MEX:MDA.A)
said Monday they signed a broad technology collaboration agreement with Mendel Biotechnology Inc in the field of agricultural functional genomics.

The agreement gives Monsanto and Empresas exclusive access to Mendel's technical capabilities in plant genetics and genomics for many crops, including corn, soybeans, fruits and vegetables.

As part of the agreement, Monsanto and Empresas will each acquire a substantial minority equity interest in Mendel and fund a research and development program over five years. Financial terms of the agreement were not disclosed. "This collaboration gives us the abilty to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, Monsanto's assistant chief scientist and chief biotechnologist.

Kishore said the venture will greatly reduce research and development time for products that increase yield, or otherwise enhance the growing, processing or nutritional characteristics of food.

Copyright 1997

Monsanto and ELM Enter Into Joint Technology Venture With Mendel Biotechnology

PR Newswire, Monday, November 24, 1997 at 19:46 ST. LOUIS, Nov. 24 /PRNewswire/ -- Monsanto Company and Empresas La Moderna, S.A., (ELM) jointly have signed a broad technology collaboration agreement with Mendel Biotechnology Inc. in the field of agricultural functional genomics. The agreement gives Monsanto and ELM exclusive access to Mendel's technical capabilities in plant genetics and genomics for many crops, including corn and soybeans, and fruits and vegetables. Leaders in the plant genetics and genomics fields to identify the function of genes founded Mendel Biotechnology and patent the corresponding DNA sequences that will produce the intellectual property basis for the next generation of agricultural products created through biotechnology.

As part of the agreement, Monsanto and ELM will each acquire a substantial minority equity interest in Mendel and fund a research and development program over a five-year period. Financial terms of the agreement were not disclosed.

"This collaboration gives us the ability to better understand the function of specific genes in plants, thereby allowing us to more quickly introduce crops with improved agricultural traits," said Ganesh Kishore, assistant chief scientist and chief biotechnologist of Monsanto. Kishore said the venture will greatly reduce research and development time for products that increase yield, or otherwise enhance the growing, processing or nutritional characteristics of food.

"This is our first venture in genomics, and it builds nicely on our original technology agreement with Monsanto as a preferred provider of agronomic and quality traits we're already using in our fruit and vegetable seed and produce businesses," said Alfonso Romo Garza, chairman and chief executive officer of ELM. Through this partnership, ELM will have exclusive rights to Mendel's technology for the development of proprietary, transgenic fruits and vegetable products that create value for growers, processors and consumers.

Mendel's founding partners include several of the world's most renowned plant geneticists. Their scientific contributions include the isolation of genes responsible for disease resistance, the molecular genetics of nitrogen fixation in plants, the genetics of photosynthesis and plant oil biosynthesis, the development of synthetic plastic in plants, and the gene transfer technology for corn.

"Applying cutting-edge genomics capabilities to the agricultural biotechnology strengths these two leaders have developed is a natural next step in discovering and commercializing improved-trait crops," said Christopher R. Somerville, chairman of Mendel. Somerville also is the director of the Department of Plant Biology at the Carnegie Institution of Washington, and professor of the Department of Biological Sciences at Stanford University. The president and chief executive officer of Mendel Biotechnology is Michael Fromm, a former research director of enabling technologies and plant genomics for Monsanto.

As a life sciences company, Monsanto is committed to finding solutions to the growing global needs for food and health by sharing common forms of science and technology among agriculture, nutrition and health. ELM is a leading agribusiness biotechnology company focused on developing and marketing premium branded vegetable seeds, as well as fresh and processed fruits and vegetables. Mendel, a newly formed biotechnology company, specializes in applying functional genomics techniques to create valuable new traits in plants.

SOURCE: Monsanto Company 11/24/97

CONTACT: Lori J. Fisher, 314-694-8535
Lori.J.Fisher@monsanto.com

Search Strategy

Since Mendel Biotechnology Inc. is known to have interests in Arabidopsis, the company name was used as a search term to uncover new patent documents.

The Mendel Biotechnology Inc. patents discussed in this section were identified using the following search strategy.

Search details

Date of search

18 05 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(arabidopsis AND thaliana) AND (Mendel AND Biotechnology)

Results

161

Comments

Of the 161 results identified using these search terms, 11 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims.  The correspondent or applicant named on all these applications is Mendel Biotechnology Inc., 21375 Cabot Blvd., Hayward, CA, 94545.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

As a precaution, a further search was performed:  this time no assignee/applicant/correspondent names were used.  Instead the strategy conducted was:

Search details

Date of search

14 06 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(mendel AND biotechnology) AND ((SEQ AND ID) in claims) 

Results

186

Comments

Of the 186 results identified using these search terms, more than 10 were found to claim large numbers of Arabidopsis sequences (mostly transcription factors). The remainder were also of interest since they appeared to also claim smaller groups of transcription factors.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patent applications for Arabidopsis bulk sequences made by Mendel Biotechnology Inc.

The following is an example of one of the 11 patent documents identified as belonging to Mendel Biotechnology Inc.  The '927 application is discussed here for the following reasons:

(Note that this patent application contains by far the most sequence claims compared to other applications.  Note also that there are ~1800 TFs in the Arabidopsis genome and there are >1900 SEQ IDs listed in this application alone! Hence this application is likely to be a significant patent application with respect to TFs for many plant species.)

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2004/0019927 A1

  • Earliest priority - 18 Apr 2001
  • Filed - 25 Feb 2003
  • Granted -
  • Expected expiry -

THIS IS NOT A GRANTED
PATENT

Title - Polynucleotides and polypeptides in plants

Claim 1

A transgenic plant comprising a recombinant polynucleotide having a polynucleotide sequence, or a complementary polynucleotide sequence thereof, selected from the group consisting of: 
    (a) a polynucleotide sequence encoding a polypeptide, wherein said polypeptide is selected from the group consisting of SEQ ID NO: 2N, wherein N=1-229, SEQ ID NO: 467; 488-490; 501-503; 505; 512-515; 521-522; 525-526; 528; 530; 534-537; 540; 558-559; 569; 587-594; 597; 607-609; 621-626; 635-639; 665-669; 708-713; 720-721; 736-739; 742; 780-807; 824; 835-837; 851-854; 865-867; 890-891; 903-907; 910-913; 922-923; 926; 933-934; 943; 953-960; 966-967; 987-988;994; 1011; 1035-1042; 1064-1073; 1081-1090; 1105-1110; 1122; 1129-1133; 1139-1141; 1157-1158; 1176-1186; 1191; 1200-1201; 1221-1248; 1254-1257; 1263; 1270; 1288-1291; 1302; 1310-1314; 1324-1327; 1338-1339; 1342-1343; 1362-1364; 1378; 1391-1392; 1395; 1399-1418; 1433; 1453-1454; 1457-1459; 1466-1467; 1492-1498; 1500-1501; 1503-1504; 1522; 1528; 1533-1535; 1540-1541; 1563-1566; 1572; 1583-1586; 1593-1594; 1621-1624; 1645-1646; 1655-1658; 1670; 1674; 1681; 1687; 1701-1705; 1710-1713; 1727; 1735-1737; 1743; 1754-1756; 1761-1762; 1765; 1769; 1781; 1785; 1790; 1805; 1813; 1838-1846; 1857; 1863; 1874-1875; 1883-1884; 1897-1901; 1911-1912; 1917-1920; 1929-1930; 1937-1939; 1942-1943; and SEQ ID NO: 2N, wherein N=974-1101;
    (b) a polynucleotide sequence encoding a polypeptide, wherein said polynucleotide sequence is selected from the group consisting of SEQ ID NO: 2N-1, wherein N=1-229, SEQ ID NO: 459-466;468-487; 491-500; 504; 506-511; 516-520; 523-524; 527; 529; 531-533; 538-539; 541-557; 560-568; 570-586; 595-596; 598-606; 610-620; 627-634; 640-664; 670-707; 714-719; 722-735; 740-741; 743-779; 808-823; 825-834; 838-850; 855-864; 868-889; 892-902; 908-909; 914-921; 924-925; 927-932; 935-942; 944-952; 961-965; 968-986; 989-993; 995-1010; 1012-1034; 1043-1063; 1074-1080; 1091-1104; 1111-1121; 1123-1128; 1134-1138; 1142-1156; 1159-1175; 1187-1190; 1192-1199; 1202-1220; 1249-1253; 1258-1262; 1264-1269; 1271-1287; 1292-1301; 1303-1309; 1315-1323; 1328-1337; 1340-1341; 1344-1361; 1365-1377; 1379-1390; 1393-1394; 1396-1398; 1419-1432; 1434-1452; 1455-1456; 1460-1465; 1468-1491; 1499; 1502; 1505-1521; 1523-1527; 1529-1532; 1536-1539; 1542-1562; 1567-1571; 1573-1582; 1587-1592; 1595-1620; 1625-1644; 1647-1654; 1659-1669; 1671-1673; 1675-1680; 1682-1686; 1688-1700; 1706-1709; 1714-1726; 1728-1734; 1738-1742; 1744-1753; 1757-1760; 1763-1764; 1766-1768; 1770-1780; 1782-1784; 1786-1789; 1791-1804; 1806-1812; 1814-1837; 1847-1856; 1858-1862; 1864-1873; 1876-1882; 1885-1896; 1902-1910; 1913-1916; 1921-1928; 1931-1936; 1940-1941; 1944-1946, and SEQ ID NO: 2N-1, wherein N=974-1101;
    (c) a polynucleotide sequence encoding the polypeptide sequence of (a) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (a) with conservative substitutions is a transcription factor;
    (d) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
    (e) an orthologous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said orthologous sequence encodes a polypeptide;
    (f) a paralogous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said paralogous sequence encodes a polypeptide;
    (g) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (a), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (a) is bounded by amino acid residue coordinates according to Table 5; and
    (h) a polynucleotide that hybridizes to the polynucleotide of (a) or (b) under stringent conditions.

Claim 92

An expression cassette comprising:
    (1) a constitutive, inducible, or tissue-specific promoter; and
    (2) a recombinant polynucleotide having a polynucleotide sequence, or a complementary polynucleotide sequence thereof, selected from the group consisting of:
        (a) a polynucleotide sequence encoding a polypeptide, wherein said polypeptide is selected from the group consisting of SEQ ID NO: 2N, wherein N=1-229, SEQ ID NO: 467; 488-490; 501-503; 505; 512-515; 521-522; 525-526; 528; 530; 534-537; 540; 558-559; 569; 587-594; 597; 607-609; 621-626; 635-639; 665-669; 708-713; 720-721; 736-739; 742; 780-807; 824; 835-837; 851-854; 865-867; 890-891; 903-907; 910-913; 922-923; 926; 933-934; 943; 953-960; 966-967;987-988;994; 1011; 1035-1042; 1064-1073; 1081-1090; 1105-1110; 1122; 1129-1133; 1139-1141; 1157-1158; 1176-1186; 1191; 1200-1201; 1221-1248; 1254-1257; 1263; 1270; 1288-1291; 1302; 1310-1314; 1324-1327; 1338-1339; 1342-1343; 1362-1364; 1378; 1391-1392; 1395; 1399-1418; 1433; 1453-1454; 1457-1459; 1466-1467; 1492-1498; 1500-1501; 1503-1504; 1522; 1528; 1533-1535; 1540-1541; 1563-1566; 1572; 1583-1586; 1593-1594; 1621-1624; 1645-1646; 1655-1658; 1670; 1674; 1681; 1687; 1701-1705; 1710-1713; 1727; 1735-1737; 1743; 1754-1756; 1761-1762; 1765; 1769; 1781; 1785; 1790; 1805; 1813; 1838-1846; 1857; 1863; 1874-1875; 1883-1884; 1897-1901; 1911-1912; 1917-1920; 1929-1930; 1937-1939; 1942-1943; and SEQ ID NO: 2N, wherein N=974-1101;
        (b) a polynucleotide sequence encoding a polypeptide, wherein said polynucleotide sequence is selected from the group consisting of SEQ ID NO: 2N-1, wherein N=1-229, SEQ ID NO: 459-466; 468-487; 491-500; 504; 506-511; 516-520; 523-524; 527; 529; 531-533; 538-539; 541-557; 560-568; 570-586; 595-596; 598-606; 610-620; 627-634; 640-664; 670-707; 714-719; 722-735; 740-741; 743-779; 808-823; 825-834; 838-850; 855-864; 868-889; 892-902; 908-909; 914-921; 924-925; 927-932; 935-942; 944-952; 961-965; 968-986; 989-993; 995-1010; 1012-1034; 1043-1063; 1074-1080; 1091-1104; 1111-1121; 1123-1128; 1134-1138; 1142-1156; 1159-1175; 1187-1190; 1192-1199; 1202-1220; 1249-1253; 1258-1262; 1264-1269; 1271-1287; 1292-1301; 1303-1309; 1315-1323; 1328-1337; 1340-1341; 1344-1361; 1365-1377; 1379-1390; 1393-1394; 1396-1398; 1419-1432; 1434-1452; 1455-1456; 1460-1465; 1468-1491; 1499; 1502; 1505-1521; 1523-1527; 1529-1532; 1536-1539; 1542-1562; 1567-1571; 1573-1582; 1587-1592; 1595-1620; 1625-1644; 1647-1654; 1659-1669; 1671-1673; 1675-1680; 1682-1686; 1688-1700; 1706-1709; 1714-1726; 1728-1734; 1738-1742; 1744-1753; 1757-1760; 1763-1764; 1766-1768; 1770-1780; 1782-1784; 1786-1789; 1791-1804; 1806-1812; 1814-1837; 1847-1856; 1858-1862; 1864-1873; 1876-1882; 1885-1896; 1902-1910; 1913-1916; 1921-1928; 1931-1936; 1940-1941; 1944-1946, and SEQ ID NO: 2N-1, wherein N=974-1101;
        (c) a polynucleotide sequence encoding the polypeptide sequence of (a) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (a) with conservative substitutions is a transcription factor;
        (d) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
        (e) an orthologous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said orthologous sequence encodes a polypeptide that is a transcription factor;
        (f) a paralogous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said paralogous sequence encodes a polypeptide that is a transcription factor;
        (g) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (a), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (a) is bounded by amino acid residue coordinates according to Table 5; and
        (h) a polynucleotide that hybridizes to the polynucleotide of (a) or (b) under stringent conditions; wherein said recombinant polynucleotide is operably linked to said promoter.

Claim 94

 A method for producing a modified plant having a modified trait, the method comprising:
    (a) selecting a polynucleotide that encodes a polypeptide, wherein said polynucleotide has a sequence, or a complementary sequence thereof, selected from the group consisting of:
        (i) a polynucleotide sequence encoding a polypeptide, wherein said polypeptide is selected from the group consisting of SEQ ID NO: 2N, wherein N=1-229, SEQ ID NO: 467; 488-490; 501-503; 505; 512-515; 521-522; 525-526; 528; 530; 534-537; 540; 558-559; 569; 587-594; 597; 607-609; 621-626; 635-639; 665-669; 708-713; 720-721; 736-739; 742; 780-807; 824; 835-837; 851-854; 865-867; 890-891; 903-907; 910-913; 922-923; 926; 933-934; 943; 953-960; 966-967; 987-988; 994; 1011; 1035-1042; 1064-1073; 1081-1090; 1105-1110; 1122; 1129-1133; 1139-1141; 1157-1158; 1176-1186; 1191; 1200-1201; 1221-1248; 1254-1257; 1263; 1270; 1288-1291; 1302; 1310-1314; 1324-1327; 1338-1339; 1342-1343; 1362-1364; 1378; 1391-1392; 1395; 1399-1418; 1433; 1453-1454; 1457-1459; 1466-1467; 1492-1498; 1500-1501; 1503-1504; 1522; 1528; 1533-1535; 1540-1541; 1563-1566; 1572; 1583-1586; 1593-1594; 1621-1624; 1645-1646; 1655-1658; 1670; 1674; 1681; 1687; 1701-1705; 1710-1713; 1727; 1735-1737; 1743; 1754-1756; 1761-1762; 1765; 1769; 1781; 1785; 1790; 1805; 1813; 1838-1846; 1857; 1863; 1874-1875; 1883-1884; 1897-1901; 1911-1912; 1917-1920; 1929-1930; 1937-1939; 1942-1943; and SEQ ID NO: 2N, wherein N=974-1101;
        (ii) a polynucleotide sequence encoding a polypeptide, wherein said polynucleotide sequence is selected from the group consisting of SEQ ID NO: 2N-1, wherein N=1-229, SEQ ID NO: 459-466; 468-487; 491-500; 504; 506-511; 516-520; 523-524; 527; 529; 531-533; 538-539; 541-557; 560-568; 570-586; 595-596; 598-606; 610-620; 627-634; 640-664; 670-707; 714-719; 722-735; 740-741; 743-779; 808-823; 825-834; 838-850; 855-864; 868-889; 892-902; 908-909; 914-921; 924-925; 927-932; 935-942; 944-952; 961-965; 968-986; 989-993; 995-1010; 1012-1034; 1043-1063; 1074-1080; 1091-1104; 1111-1121; 1123-1128; 1134-1138; 1142-1156; 1159-1175; 1187-1190; 1192-1199; 1202-1220; 1249-1253; 1258-1262; 1264-1269; 1271-1287; 1292-1301; 1303-1309; 1315-1323; 1328-1337; 1340-1341; 1344-1361; 1365-1377; 1379-1390; 1393-1394; 1396-1398; 1419-1432; 1434-1452; 1455-1456; 1460-1465; 1468-1491; 1499; 1502; 1505-1521; 1523-1527; 1529-1532; 1536-1539; 1542-1562; 1567-1571; 1573-1582; 1587-1592; 1595-1620; 1625-1644; 1647-1654; 1659-1669; 1671-1673; 1675-1680; 1682-1686; 1688-1700; 1706-1709; 1714-1726; 1728-1734; 1738-1742; 1744-1753; 1757-1760; 1763-1764; 1766-1768; 1770-1780; 1782-1784; 1786-1789; 1791-1804; 1806-1812; 1814-1837; 1847-1856; 1858-1862; 1864-1873; 1876-1882; 1885-1896; 1902-1910; 1913-1916; 1921-1928; 1931-1936; 1940-1941; 1944-1946, and SEQ ID NO: 2N-1, wherein N=974-1101;
        (iii) a polynucleotide sequence encoding the polypeptide sequence of (i) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (i) with conservative substitutions is a transcription factor;
        (iv) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (i) or (ii), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
        (v) an orthologous sequence of the polynucleotide sequences of (i) or (ii), which is at least 80% identical to a sequence of (i) or (ii), and wherein said orthologous sequence encodes a polypeptide that is a transcription factor;
        (vi) a paralogous sequence of the polynucleotide sequences of (i) or (ii), which is at least 80% identical to a sequence of (i) or (ii), and wherein said paralogous sequence encodes a polypeptide that is a transcription factor;
        (vii) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (i), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (i) is bounded by amino acid residue coordinates according to Table 5, and
        (viii) a polynucleotide that hybridizes to the polynucleotide of (i) or (ii) under stringent conditions;
    (b) inserting the polynucleotide into an expression cassette according to claim 92;
    (c) introducing the expression cassette into a plant or a cell of a plant to overexpress the polypeptide, thereby producing said modified plant; and (d) selecting said modified plant having said modified trait. 

The claims are generally drawn to:

  • A transgenic plant comprising a recombinant polynucleotide having a sequence, or a complementary sequence thereof, selected from the group of:
    • (a) a polynucleotide sequence encoding a polypeptide, selected from the group consisting of SEQ ID NO: ...(long list)
    • (b) a polynucleotide sequence encoding a polypeptide, selected from the group consisting of SEQ ID NO: ... (long list)
    • (c) a polynucleotide sequence encoding the polypeptide sequence of (a) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (a) with conservative substitutions is a transcription factor;
    • (d) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
    • (e) an orthologous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said orthologous sequence encodes a polypeptide;
    • (f) a paralogous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said paralogous sequence encodes a polypeptide;
    • (g) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (a), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (a) is bounded by amino acid residue coordinates according to Table 5; and
    • (h) a polynucleotide that hybridizes to the polynucleotide of (a) or (b) under stringent conditions.

A transgenic plant comprising the SEQ IDs listed above, including polynucleotides as in (a), transcription factors with conservative substitutions from (a), transcription factors at least 80% identical to (a) or (b), orthologous or paralogs at least 80% identical to (a) or (b), polynucleotides containing conserved domains at least 80% identical to conserved domains in (a) or (b), and polynucleotides that hybridise to those in (a) or (b) under stringent conditions.

  • An expression cassette comprising:
    • (1) a constitutive, inducible, or tissue-specific promoter; and
    • (2) a recombinant polynucleotide having a polynucleotide sequence, or a complementary polynucleotide sequence thereof, selected from the group consisting of:
      • (a) a polynucleotide sequence encoding a polypeptide, selected from the group of SEQ ID NO: ...(long list)
      • (b) a polynucleotide sequence encoding a polypeptide, selected from the group of SEQ ID NO: ...(long list)
      • (c) a polynucleotide sequence encoding the polypeptide sequence of (a) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (a) with conservative substitutions is a transcription factor;
      • (d) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
      • (e) an orthologous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said orthologous sequence encodes a polypeptide that is a transcription factor;
      • (f) a paralogous sequence of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (a) or (b), and wherein said paralogous sequence encodes a polypeptide that is a transcription factor;
      • (g) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (a), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (a) is bounded by amino acid residue coordinates according to Table 5; and
      • (h) a polynucleotide that hybridizes to the polynucleotide of (a) or (b) under stringent conditions; wherein said recombinant polynucleotide is operably linked to said promoter.

An expression cassette comprising, a promoter of (1) above, and containing a polynucleotide [as described above for claim 1]

  • A method for producing a modified plant having a modified trait, the method comprising:
    • (a) selecting a polynucleotide that encodes a polypeptide, wherein said polynucleotide has a sequence, or a complementary sequence thereof, selected from the group consisting of:
      • (i) a polynucleotide sequence encoding a polypeptide, selected from the group of SEQ ID NO: ... (long list)
      • (ii) a polynucleotide sequence encoding a polypeptide, selected from the group of SEQ ID NO: ... (longlist)
      • (iii) a polynucleotide sequence encoding the polypeptide sequence of (i) with conservative substitutions as defined in Table 2, wherein said polypeptide sequence of (i) with conservative substitutions is a transcription factor;
      • (iv) a variant of the polynucleotide sequences of (a) or (b), which is at least 80% identical to a sequence of (i) or (ii), and wherein said sequence variant encodes a polypeptide that is a transcription factor;
      • (v) an orthologous sequence of the polynucleotide sequences of (i) or (ii), which is at least 80% identical to a sequence of (i) or (ii), and wherein said orthologous sequence encodes a polypeptide that is a transcription factor;
      • (vi) a paralogous sequence of the polynucleotide sequences of (i) or (ii), which is at least 80% identical to a sequence of (i) or (ii), and wherein said paralogous sequence encodes a polypeptide that is a transcription factor;
      • (vii) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that exhibits at least 80% sequence homology with the conserved domain of the polypeptide of (i), wherein said polypeptide comprising a conserved domain of a transcription factor; and wherein said conserved domain of (i) is bounded by amino acid residue coordinates according to Table 5, and
      • (viii) a polynucleotide that hybridizes to the polynucleotide of (i) or (ii) under stringent conditions; (b) inserting the polynucleotide into an expression cassette according to claim 92; (c) introducing the expression cassette into a plant or a cell of a plant to overexpress the polypeptide, thereby producing said modified plant; and (d) selecting said modified plant having said modified trait.

This time the claim is towards a method for producing a modified plant comprising a polynucleotide [as described above for claim 1]

The claims above are defined with reference to the sequences defined by SEQ ID numbers.  One interesting question here is the degree to which an examiner is able to effectively search for prior art.  It is not unreasonable to expect that the details of one or more transcription factors having at least 80% identical sequence to one or more of the SEQ IDs above may already be published in the literature or in another patent document. 

Original Applicant:

Mendel Biotechnology, Inc. , 21375 Cabot Blvd., Hayward, CA, 94545,

For information concerning this application, we would suggest contacting either:

MATTHEW KASER Mendel Biotechnology, Inc. ,
21375 Cabot Blvd., , Hayward, CA, 94545,
(correspondent named on the application)

or:

Dr. William Goure,
Vice President for Commercial Development
Mendel Biotechnology, Inc.
21375 Cabot Boulevard
Hayward, CA 94545
t- 510 259 6122
 wgoure@mendelbio.com

Remarks

Again, as with the Paradigm patent applications mentioned above, it is unlikely that this application would result in a single granted patent for all of these sequence claims.  However, through numerous additional applications (continuations, CIPs and divisionals) it is entirely possible that any of the sequences listed may evolve into separate applications and may eventually be granted as patents.  Such a process may take many years.


Other interesting Mendel Biotechnology Inc. patent documents include

Document

Title

DNA Sequences
Claimed

US2003/0093837A1

Polynucleotides for seed trait alteration

74

US2004/0101481A1

Plant gene sequences I

85

US2003/0131386A1

Stress-induced polynucleotides

111

US2003/0229915A1

Plant gene sequences II

104

US2003/0233670A1

Gene sequences and uses thereof in plants

368

WO01/35725A1

Yield-related genes

35

WO02/15675A1

Genes for modifying plant traits IV

232

US 20040019927 A1

US 2007/22495

This Mendel patent application, one of a family of, to date, over 180 patent applications filed in Australia, Brazil, Canada, Japan, Mexico, the PCT, and possibly other jurisdictions that do not report to INPADOC, relates to polynucleotides and polypeptides associated with transcription factors from A. thaliana.   Note how non-specific the utility is:  particularly, the transcription factors are those responsible for modifying phenotypes of plants.  Very broad language is used for the phenotypes in claim 1b below;  a patent examiner could rightly question whether applicant has described how any one of the sequences of part (a) of the claim relate to any one of the phenotypes in part (b) of the claim, because on its face the claim is to any plant having any characteristic listed in claim 1b and containing any sequence, regardless of source, that is somewhat like any of the sequences in claim 1a.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2007/22495

  • Earliest priority - 17 Nov 1999
  • Filed - 30 Jun 2006
  • Granted - Pending
  • Expected expiry - N/A

Title - Transcription factors for increasing yield

Claim 1

A transgenic plant having an improved trait relative to a control plant, wherein:

    (a) the transgenic plant comprises a recombinant polynucleotide encoding a first polypeptide having a conserved domain at least 65% identical to the conserved domain of a second polypeptide selected from the group consisting of SEQ ID NO: 110, 112, 116, 120, 124, 128, 131, 135, 139, 143, 147, 151, 155, 159, 163, 167, 171, 175, 179, 183, 187, 191, 195, 199, 203, 207, 211, 215, 219, 223, 227, 231, 235, 239, 243, 247, 251, 255, 259, 263, 267, 271, 275, 280, 284, 288, 292, 296, 299, 303, 306, 309, 313, 317, 321, 325, 329, 333, 337, 341, 345, 349, 353, 357, 361, 365, 369, 373, 377, 381, 385, 389, 393, 397, 401, 404, 406, 409, 413, 416, 419, 422, 425, 428, 431, 435, 439, 443, 447, 451, 454, 458, 462, 465, 468, 471, 475, 478, 482, 485, 489, 493, 497, 501, 505, 509, 512, 515, 519, 522, 526, 530, 534, 538, 542, 546, 550, 553, 557, 561, 565, 568, 571, 574, 577, 581, 585, 588, 591, 594, 597, 601, 605, 609, 613, 616, 620, 624, 628, 632, 636, 640, 644, 648, 652, 656, 660, 664, 667, 671, 674, 678, 682, 686, 689, 692, 696, 700, 704, 708, 712, 715, 719, 723, 727, 731, 734, 738, 741, 745, 749, 752, 756, 760, 762, 766, 770, 774, 778, 782, 786, 789, 793, 797, 801, 805, 809, 813, 816, 819, 823, 827, 831, 835, 839, 843, 847, 851, 855, 859, 863, 867, 871, 874, 878, 882, 886, 890, 894, 898, 901, 905, 909, 913, 917, 921, 925, 929, 933, 937, 941, 945, 949, 953, 957, 960, 963, 966, 970, 973, 976, 980, 984, 988, 992, 995, 999, 1003, 1007, 1011, 1015, 1019, 1023, 1027, 1031, 1037, 1041, 1045, 1049, 1052, 1056, 1060, 1064, 1067, 1071, 1075, 1078, 1081, 1085, 1089, 1093, 1097, 1101, 1104, 1108, 1112, 1116, 1120, 1123, 1126, 1130, 1134, 1138, 1142, 1145, 1148, 1151, 1154, 1157, 1161, 1165, 1169, 1173, 1177, 1181, 1185, 1188, 1192, 1195, 1199, 1203, 1207, 1211, 1215, 1219, 1222, 1226, 1229, 1233, 1236, 1240, 1243, 1247, 1251, 1254, 1258, 1262, 1266, 1269, 1273, 1277, 1281, 1285, 1289, 1293, 1297, 1300, 1304, 1308, 1311, 1314, 1318, 1322, 1326, 1330, 1334, 1338, 1342, 1346, 1350, 1354, 1358, 1361, 1365, 1369, 1372, 1376, 1380, 1384, 1388, 1392, 1396, 1400, 1404, 1408, 1411, 1415, 1419, 1423, 1427, 1431, 1435, 1439, 1443, 1446, 1449, 1452, 1455, 1459, 1463, 1467, 1470, 1474, 1477, 1481, 1488, 1492, 1495, 1499, 1503, 1507, 1511, 1515, 1519, 1522, 1526, 1530, 1533, 1537, 1541, 1545, 1549, 1553, 1557, 1561, 1565, 1568, 1572, 1576, 1579, 1583, 1586, 1589, 1593, 1596, 1598, 1602, 1604, 1608, 1611, 1614, 1617, 1620, 1624, 1628, 1632, 1636, 1640, 1645, 1648, 1652, 1656, 1660, 1664, 1668, 1672, 1676, 1680, 1684, 1688, 1692, 1696, 1700, 1704, 1707, 1711, 1715, 1719, 1722, 1726, 1729, 1733, 1737, 1741, 1745, 1749, 1753, 1757, 1761, 1765, 1769, 1773, 1777, 1781, 1785, 1789, 1793, 1796, 1800, 1803, 1806, 1809, 1812, 1816, 1820, 1824, 1827, 1831, 1835, 1838, 1841, 1844, 1846, 1850, 1853, 1857, 1861, 1865, 1869, 1873, 1877, 1881, 1885, 1889, 1893, 1897, 1901, 1904, 1908, 1912, 1916, 1920, 1924, 1928, 1932, 1935, 1939, 1943, 1949, 1957, 1961, 1964, 1967, 1970, 1973, 1977, 1981, 1984, 1986, 1988, 1990, 1992, 1994, 1996, 1998; and 1999-2007;

    (b) the improved trait is selected from the group consisting of larger size, larger seeds, greater yield, darker green color, increased rate of photosynthesis, more tolerance to osmotic stress, more drought tolerance, more heat tolerance, more salt tolerance, more cold tolerance, more tolerance to low nitrogen, early flowering, delayed flowering, more resistance to disease, more seed protein, and more seed oil relative to the control plant.

Claim 8

A method for producing a transgenic plant having larger size, greater yield, higher photosynthetic rate or delayed flowering relative to a control plant, the method comprising the step of transforming a target plant with an expression vector comprising a recombinant polynucleotide encoding a polypeptide having a conserved domain at least 91% identical to the conserved domain of amino acids 146-194 of SEQ ID NO: 1796, thus producing the transgenic plant.

Claim 15

A transgenic plant comprising a recombinant polynucleotide encoding a conserved domain of amino acids 146-194 of SEQ ID NO: 1796.

Mendel Biotechnology, Inc.

Remarks

This application is a Continuation-in-part of application No. 09/713994, filed on 16 Nov 2000, now abandoned. 

Patent or Publication No.

Title, Independent Claims and Summary

Assignee

US 6835540

  • Earliest priority - 16 Mar 2001
  • Filed - 16 Mar 2001
  • Granted - 28 Dec 2004
  • Expected expiry - 16 Mar 2021

Title - Biosynthetic pathway transcription factors

Claim 1

A method of determining whether a member of a pool of cloned test transcription factor polynucleotides encodes a plant pathway transcription factor, the method comprising: collecting a pool of cloned test transcription factor polynucleotides; introducing into a plant cell a nucleic acid comprising a plant promoter of a pathway gene operably linked to a reporter gene; introducing into the plant cell a member of the pool of cloned test transcription factor polynucleotides, wherein said member is selected on the basis of structural similarity to a known transcription factor for a pathway gene; and detecting expression of said reporter gene in the plant cell, thereby determining whether the member of the cloned test transcription factor polynucleotide pool encodes a plant pathway transcription factor.

Claim 15

A method of determining whether a member of a pool of test transcription factor polynucleotides encodes a biosynthetic pathway transcription factor, comprising: PA1 introducing into a plant cell nucleic acids comprising the test transcription factor polynucleotides and detecting expression of a biosynthetic pathway gene in the plant cell by quantitation of the biosynthetic pathway gene RNA level, wherein said member is selected on the basis of structural similarity to a known transcription factor for a pathway gene; PA1 thereby determining whether the member of the cloned test transcription factor polynucleotide pool encodes a plant pathway transcription factor.

Claim 16

A method of determining whether two or more members of a pool of cloned test transcription factor polynucleotides are required for expression from a pathway gene promoter, wherein said two or more members of a pool of cloned test transcription factor polynucleotides are selected on the basis of structural similarity to a known transcription factor for a pathway gene, the method comprising: PA1 collecting a pool of cloned test transcription factor polynucleotides; PA1 introducing into plant cell a nucleic acid comprising a biosynthetic pathway gene promoter operably linked to a reporter gene; PA1 introducing into plant cell the pool of cloned test transcription factor polynucleotides; PA1 detecting expression from said biosynthetic pathway gene promoter in the plant cell, and deconvoluting the pool of cloned test transcription factor polynucleotides to identify the minimum number of cloned test transcription factor polynucleotides necessary to detect expression from said biosynthetic pathway gene promoter; PA1 thereby determining whether two or more members of the cloned test transcription factor polynucleotide pool are required for expression from said biosynthetic pathway gene promoter.

Mendel Biotechnology Inc.

US 6946586

Patent or Publication No.

Title, Independent Claims and Summary

Assignee

US 6946586 B1

  • Earliest priority - 7 Apr 1999
  • Filed - 6 Apr 2000
  • Granted - 20 Sep 2005
  • Expected expiry - 6 Apr 2020

Title - Genetic trait breeding method

Claim 1

A method for screening for overexpression of a transcription factor polynucleotide in plants, said method comprising (a) providing a first pool of donor vectors, wherein each donor vector member comprises a LexA DNA binding domain fused to a GAL4 transcription activation domain, and a second pool of receptor vectors, wherein each receptor vector member comprises a LexA DNA binding site operably linked to a transcription factor polynucleotide; (b) transforming a first plant with a member of said donor vector pool; (c) transforming a second plant with a member of said receptor vector pool; (d) crossing said first and second transformed plants to generate a hybrid plant, wherein the hybrid plant comprises the member of said donor vector pool and the member of said receptor vector pool; and (e) characterizing the phenotype of said hybrid plant so as to identify a hybrid plant overexpressing the transcription factor polynucleotide.

Claim 8

A method for breeding a plant with enhanced tolerance to environmental stress, said method comprising the steps of (a) providing a first transformed plant, wherein said first transformed plant is transformed with a donor vector comprising a LexA DNA binding domain fused to a GAL4 transcription activation domain; (b) providing a second transformed plant, wherein said second transformed plant is transformed with a receptor vector comprising a LexA DNA binding site operably linked to a polynucleotide encoding a transcription factor that confers enhanced tolerance to environmental stress when overexpressed; (c) crossing said first and second transformed plants to generate a hybrid plant, wherein the hybrid plant comprises the donor vector and the receptor vector; and (d) selecting a hybrid plant that overexpresses the transcription factor, wherein the selected hybrid plant has enhanced tolerance to environmental stress.

Claim 16

A method for increasing expression of a transcription factor polynucleotide in a plant, said method comprising (a) providing a first pool of donor vectors, wherein each donor vector member comprises a LexA DNA binding domain fused to a GAL4 transcription activation domain, and a second pool of receptor vectors, wherein each receptor vector member comprises a LexA DNA binding site operably linked to a transcription factor polynucleotide; (b) transforming a first plant with a member of said donor pool; (c) transforming a second plant with a member of said receptor pool; and (d) crossing first and second transformed plants to generate a hybrid plant with increased expression of a transcription factor polynucleotide.

Mendel Biotechnology Inc.

US 7193129

Patent or Publication No.

Title, Independent Claims and Summary

Assigne

US 7193129 B2

  • Earliest priority - 18 Apr 2001
  • Filed - 9 Aug 2002
  • Granted - 20 Mar 2007
  • Expected expiry - 24 Jun 2022

Title - Stress-related polynucleotides and polypeptides in plants

Claim 1

A method for increasing the tolerance of a plant to nitrogen limiting conditions as compared to a wild-type plant of the same species, the method steps comprising (a) inserting a recombinant polynucleotide having a nucleotide sequence encoding a polypeptide comprising a conserved domain with at least about 80% sequence identity to a conserved domain of amino acid coordinates 39–76 of SEQ ID NO: 52 into an expression vector, wherein the conserved domain is a DNA-binding domain;(b) introducing the expression vector into a target plant or plant cell to generate at least one transformed plant, wherein the polypeptide is overexpressed in the at least one transformed plant and said overexpression of the polypeptide results in the at least one transformed plant having greater tolerance to the nitrogen limiting conditions as compared to the wild-type plant; and(c) from the at least one transformed plant, identifying a transgenic plant having greater tolerance to the nitrogen limiting conditions than the wild-type plant as a result of the overexpression of the polypeptide in the transgenic plant.

Claim 7

A method for producing a transgenic plant having greater tolerance to nitrogen limiting conditions than a wild-type plant of the same species, the method steps comprising (a) inserting a recombinant polynucleotide having a nucleotide sequence encoding a polypeptide comprising a conserved domain with at least about 80% sequence identity to a conserved domain of amino acid coordinates 39–76 of SEQ ID NO: 52 into an expression vector, wherein the conserved domain is a DNA-binding domain;(b) introducing the expression vector into a target plant or plant cell to generate at least one transformed plant, wherein the polypeptide is overexpressed in the at least one transformed plant and said overexpression of the polypeptide results in the at least one transformed plant having greater tolerance to the nitrogen limiting conditions as compared to the wild-type plant; and(c) from the at least one transformed plant, identifying a transgenic plant having greater tolerance to the nitrogen limiting conditions than the wild-type plant as a result of the overexpression of the polypeptide in the transgenic plant.

Mendel Biotechnology Inc.

US 7196245

Patent or Publication No.

Title, Independent Claims and Summary

Assigne

US 7196245 B2

  • Earliest priority - 18 Sep 2002
  • Filed - 18 Sep 2003
  • Granted - 27 Mar 2007
  • Expected expiry - 16 Mar 2024

Title - Polynucleotides and polypeptides that confer increased biomass and tolerance to cold, water deprivation and low nitrogen to plants

Claim 1

A transgenic plant transformed with an expression vector comprising a polynucleotide sequence encoding a polypeptide having a conserved domain that has at least 70% sequence identity to the conserved domain of amino acid coordinates 111–164 of SEQ ID NO: 194, wherein the conserved domain is a WRKY DNA-binding domain and expression of the polynucleotide in said transgenic plant results in said plant having a greater tolerance to water deprivation than a control plant.

Claim 4

A method for producing a transgenic plant having greater tolerance to water deprivation than a control plant, the method steps comprising (a) providing an expression vector comprising (i) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that has at least 70% sequence identity to the conserved domain of amino acid coordinates 111–164 of SEQ ID NO: 194, wherein the conserved domain is a WRKY DNA-binding domain and expression of the polynucleotide in said transgenic plant results in said plant having a greater tolerance to water deprivation than a control plant; and(ii) at least one regulatory element operably linked to the polynucleotide sequence, wherein said at least one regulatory element controls expression of the polynucleotide sequence in a target plant;(b) introducing the expression vector into at least one plant; and(c) selecting at least one transgenic plant that has greater tolerance to water deprivation than the control plant.

Claim 12

A method for increasing the tolerance of a plant to water deprivation, the method steps comprising (a) providing an expression vector comprising (i) a polynucleotide sequence encoding a polypeptide comprising a conserved domain that has at least 70% sequence identity to the conserved domain of amino acid coordinates 111–164 of SEQ ID NO: 194, wherein the conserved domain is a WRKY DNA-binding domain and expression of the polynucleotide in the transgenic plant results in said plant having a greater tolerance to water deprivation than a control plant; and(ii) at least one regulatory element flanking the polynucleotide sequence, wherein said at least one regulatory element controls expression of the polynucleotide sequence in a target plant;(b) introducing the expression vector into a plant, thereby producing a transgenic plant; and(c) selecting a transgenic plant having greater tolerance to water deprivation than a control plant.

Mendel Biotechnology Inc.

US 7223904

Patent or Publication No.

Title, Independent Claims and Summary

Assigne

US 7223904 B2

  • Earliest priority - 18 Feb 1999
  • Filed - 22 Nov 2002
  • Granted - 29 May 2007
  • Expected expiry - 15 Jul 2020

Title - Plant gene sequences II   

Claim 1

A transgenic plant transformed with a polynucleotide comprising a nucleotide sequence; wherein the nucleotide sequence encodes a polypeptide having at least 90% amino acid sequence identity to SEQ ID NO: 162, the polypeptide is a member of the Myb protein family, and the polypeptide comprises a Myb domain;wherein the polypeptide regulates expression of at least one gene;and wherein when the polypeptide is expressed in the transgenic plant, said expression results in the transgenic plant having greater tolerance to heat as compared to a wild-type plant of the same species.

Mendel Biotechnology Inc.

Patent claiming Transcription Factors from Arabidopsis thaliana

The following is an example of one of the granted patents for transcription factors from Mendel Biotechnology Inc.  As previously noted, the patent is part of a much larger family of documents containing applications claiming many Arabidopsis sequences, many of which are still pending in many countries:

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 6664446

  • Earliest priority - 23 Mar 1999
  • Filed - 22 Mar 2000
  • Granted - 16 Dec 2003
  • Expected expiry - 22 Mar 2020

Title - Transgenic plants comprising polynucleotides encoding transcription factors that confer disease tolerance

Claim 1

A transgenic plant comprising a recombinant polynucleotide encoding SEQ ID NO: 18, and said transgenic plant has enhanced tolerance to fungal disease due to expression of SEQ ID NO: 18.

Claim 7

A method for enhancing the disease tolerance or resistance of a plant comprising transforming a plant with a recombinant polynucleotide encoding SEQ ID NO: 18, and said transgenic plant has enhanced tolerance to fungal disease due to expression of SEQ ID NO: 18.

Claim 13

A method for altering the expression levels of at least one gene in a plant comprising transforming the plant with a recombinant polynucleotide encoding SEQ ID NO: 18, and said transgenic plant has enhanced tolerance to fungal disease due to expression of SEQ ID NO: 18.

Original Applicant:

Mendel Biotechnology, Inc., Hayward, CA., USA

Remarks

SEQ ID NO 18 is a transcriptional activator (AtERF-1; ethylene responsive element binding factor 1). Note that the claims issued in this patent are far narrower than those of the patent application originally filed, which was discussed above.  The narrowing of claims probably reflects the existence of sequences that had already been disclosed which would have been infringing the overly broad claims of the patent application.   Note, however, that claims to all the other sequences originally claimed could still be pending.

The scope of this claim is limited to comprising language around the peptide sequence disclosed in SEQ ID NO 18.  It may be possible to achieve the same or similar phenotye modifications that expression of AtERF-1 would cause in plants by the use of a homologous sequence from elsewhere in the same genome or from another species.  However, it is not truly clear how much there is freedom to operate.  How far does the doctrine of equivalents provide protection in this case?  Would an AtERF-1 homologue less than 90% identical at the amino acid level be a work around in this case?  Also, is "tolerance" the same as disease resistance?

Note also that fungal disease is specifically mentioned in the claims of the issued patent.  However, this does NOT imply that the applicants could not obtain claims over bacterial or viral diseases or any of the other phenotypes mentioned in the applications still pending. 

This patent application is related to nucleotide sequences encoding plant transcription factors from soybean, rice, maize, barley, wheat and other plants as well as Arabidopsis thaliana.  Like the transcription factor patent application shown before, it is part of a family of over 180 patent applications filed in Australia, Brazil, Canada, Europe, Japan, the USA and the PCT as well as possibly in other jurisdictions that do not report to INPADOC.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2007/33671 A1

  • Earliest priority - 18 Sep 2002
  • Filed - 18 Sep 2003
  • Granted - Pending
  • Expected expiry - N/A

PAIR Link

Title - Polynucleotides and polypeptides in plants

Claim 1

A recombinant polynucleotide selected from the group consisting of:

    (a) a nucleotide sequence encoding a polypeptide, wherein said nucleotide sequence is selected from the group consisting of SEQ ID NO: 2N-1, where N=1-335, SEQ ID NOs: 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 684, 685, 686, 687, 688, 689, 691, 692, 693, 694, 695, 696, 697, 698, 699, 701, 702, 704, 705, 706, 708, 709, 710, 711, 712, 713, 714, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 764, 765, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 812, 813, 814, 815, 816, 817, 818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843, 844, 853, 854, 855, 856, 857, 858, 859, 860, 861, 862, 863, 864, 866, 867, 868, 869, 870, 871, 874, 875, 876, 877, 878, 879, 880, 881, 882, 883, 884, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 899, 900, 901, 902, 903, 904, 905, 906, 907, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 920, 921, 922, 923, 924, 926, 927, 928, 929, 930, 931, 938, 939, 940, 941, 942, 944, 945, 946, 947, 948, 949, 950, 951, 952, 953, 954, 955, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 972, 973, 974, 975, 976, 977, 980, 981, 982, 983, 984, 985, 986, 987, 988, 989, 990, 991, 992, 993, 994, 995, 996, 997, 998, 999, 1000, 1005, 1006, 1007, 1008, 1009, 1014, 1015, 1016, 1017, 1018, 1019, 1020, 1021, 1022, 1025, 1026, 1027, 1028, 1029, 1030, 1031, 1033, 1034, 1035, 1036, 1037, 1038, 1039, 1043, 1044, 1045, 1046, 1047, 1048, 1049, 1051, 1052, 1053, 1055, 1056, 1057, 1058, 1059, 1060, 1061, 1062, 1063, 1066, 1067, 1068, 1069, 1070, 1071, 1072, 1073, 1080, 1081, 1082, 1083, 1084, 1085, 1088, 1089, 1090, 1091, 1094, 1095, 1096, 1097, 1098, 1099, 1100, 1101, 1102, 1103, 1104, 1105, 1106, 1107, 1108, 1109, 1110, 1111, 1113, 1114, 1115, 1116, 1117, 1118, 1119, 1120, 1121, 1122, 1123, 1124, 1125, 1126, 1127, 1128, 1129, 1130, 1135, 1136, 1137, 1138, 1139, 1140, 1141, 1142, 1143, 1144, 1146, 1147, 1148, 1149, 1153, 1154, 1155, 1156, 1157, 1158, 1159, 1160, 1161, 1162, 1163, 1164, 1165, 1166, 1167, 1168, 1169, 1172, 1173, 1174, 1175, 1176, 1177, 1178, 1179, 1180, 1181, 1182, 1183, 1187, 1188, 1189, 1190, 1191, 1192, 1193, 1196, 1199, 1200, 1201, 1203, 1204, 1205, 1206, 1207, 1209, 1210, 1211, 1212, 1213, 1214, 1215, 1216, 1217, 1218, 1222, 1223, 1224, 1225, 1226, 1227, 1228, 1229, 1230, 1231, 1234, 1235, 1236, 1237, 1238, 1239, 1240, 1242, 1243, 1244, 1245, 1246, 1247, 1248, 1249, 1250, 1251, 1255, 1256, 1257, 1258, 1259, 1260, 1261, 1262, 1263, 1264, 1265, 1266, 1267, 1268, 1269, 1270, 1271, 1277, 1278, 1280, 1281, 1282, 1283, 1284, 1285, 1286, 1293, 1294, 1295, 1296, 1297, 1298, 1299, 1300, 1302, 1303, 1304, 1305, 1306, 1307, 1310, 1311, 1312, 1313, 1314, 1316, 1318, 1319, 1320, 1321, 1322, 1323, 1324, 1325, 1326, 1327, 1328, 1329, 1330, 1335, 1336, 1337, 1338, 1339, 1340, 1341, 1342, 1343, 1344, 1345, 1346, 1347, 1348, 1349, 1351, 1353, 1355, 1357, 1359, 1361, 1363, 1365, 1367, 1369, 1371, 1373, 1375, 1377, 1379, 1381, 1383, 1385, 1387, 1389, 1391, 1393, 1395, 1397, 1399, 1401, 1403, 1405, 1407, 1409, 1411, 1413, 1415, 1417, 1419, 1421, 1423, 1425, 1427, 1429, 1431, 1433, 1435, 1437, 1439, 1441, 1443, 1445, 1447, 1449, 1451, 1453, 1455, 1457, 1459, 1461, 1463, 1465, 1467, 1469, 1471, 1473, 1475, 1477, 1479, 1481, 1483, 1485, 1487, 1489, 1491, 1493, 1495, 1497, 1499, 1501, 1503, 1505, 1507, 1509, 1511, 1513, 1515, 1517, 1519, 1521, 1523, 1525, 1527, 1529, 1531, 1533, 1535, 1537, 1539, 1541, 1543, 1545, 1547, 1549, 1551, 1553, 1555, 1557, 1558, 1559, 1560, 1561, 1562, 1563, 1564, 1565, 1566, 1567, 1568, 1569, 1570, 1571, 1572, 1573, 1574, 1575, 1576, 1577, 1578, 1579, 1580, 1581, 1582, 1583, 1584, 1585, 1586, 1587, 1588, 1589, 1590, 1591, 1592, 1593, 1594, 1595, 1596, 1597, 1598, 1599, 1600, 1601, 1602, 1603, 1604, 1605, 1606, 1607, 1608, 1609, 1610, 1611, 1612, 1613, 1614, 1615, 1616, 1617, 1618, 1619, 1620, 1621, 1622, 1623, 1624, 1625, 1626, 1627, 1628, 1629, 1630, 1631, 1632, 1633, 1634, 1635, 1636, 1637, 1638, 1639, 1640, 1641, 1642, 1643, 1644, 1645, 1646, 1647, 1648, 1649, 1650, 1651, 1652, 1653, 1654, 1655, 1656, 1657, 1658, 1659, 1660, 1661, 1662, 1663, 1664, 1665, 1666, 1667, 1668, 1669, 1670, 1671, 1672, 1673, 1674, 1675, 1676, 1677, 1678, 1679, 1680, 1681, 1682, 1683, 1684, 1685, 1686, 1687, 1688, 1689, 1690, 1691, 1692, 1693, 1694, 1695, 1696, 1697, 1698, 1699, 1700, 1701, 1702, 1703, 1704, 1705, 1706, 1707, 1708, 1709, 1710, 1711, 1712, 1713, 1714, 1715, 1716, 1717, 1718, 1719, 1720, 1721, 1722, 1723, 1724, 1725, 1726, 1727, 1728, 1729, 1730, 1731, 1732, 1733, 1734, 1735, 1736, 1737, 1738, 1739, 1740, 1741, 1742, 1743, 1744, 1745, 1746, 1747, 1748, 1749, 1750, 1751, 1752, 1753, 1754, 1755, 1756, 1757, 1758, 1759, 1760, 1761, 1762, 1763, 1764, 1765, 1766, 1767, 1768, 1769, 1770, 1771, 1772, 1773, 1774, 1775, 1776, 1777, 1778, 1779, 1780, 1781, 1782, 1783, 1784, 1785, 1786, 1787, 1788, 1789, 1790, 1791, 1792, 1793, 1794, 1795, 1796, 1797, 1798, 1799, 1800, 1801, 1802, 1803, 1804, 1805, 1806, 1807, 1808, 1809, 1810, 1811, 1812, 1813, 1814, 1815, 1816, 1817, 1818, 1819, 1820, 1821, 1822, 1823, 1824, 1825, 1826, 1827, 1828, 1829, 1830, 1831, 1832, 1833, 1834, 1835, 1836, 1837, 1838, 1839, 1840, 1841, 1842, 1843, 1844, 1845, 1846, 1847, 1848, 1849, 1850, 1851, 1852, 1853, 1854, 1855, 1856, 1857, 1858, 1859, 1860, 1861, 1862, 1863, 1864, 1865, 1866, 1867, 1868, 1869, 1870, 1871, 1872, 1873, 1874, 1875, 1876, 1877, 1878, 1879, 1880, 1881, 1882, 1883, 1884, 1885, 1886, 1887, 1888, 1889, 1890, 1891, 1892, 1893, 1894, 1895, 1896, 1897, 1898, 1899, 1900, 1901, 1902, 1903, 1904, 1905, 1906, 1907, 1908, 1909, 1910, 1911, 1912, 1913, 1914, 1915, 1916, 1917, 1918, 1919, 1920, 1921, 1922, 1923, 1924, 1925, 1926, 1927, 1928, 1929, 1930, 1931, 1932, 1933, 1934, 1935, 1936, 1937, 1938, 1939, 1940, 1941, 1942, 1943, 1944, 1945, 1946, 1947, 1948, 1949, 1950, 1951, 1952, 1953, 1954, 1955, 1956, 1957, 1958, 1959, 1960, 1961, 1962, 1963, 1964, 1965, 1966, 1967, 1968, 1969, 1970, 1971, 1972, 1973, 1974, 1975, 1976, 1977, 1978, 1979, 1980, 1981, 1982, 1983, 1984, 1985, 1986, 1987, 1988, 1989, 1990, 1991, 1992, 1993, 1994, 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025, 2026, 2027, 2028, 2029, 2030, 2031, 2032, 2033, 2034, 2035, 2036, 2037, 2038, 2039, 2040, 2041, 2042, 2043, 2044, 2045, 2046, 2047, 2048, 2049, 2050, 2051, 2052, 2053, 2054, 2055, 2056, 2057, 2058, 2059, 2060, 2061, 2062, 2063, 2064, 2065, 2066, 2067, 2068, 2069, 2070, 2071, 2072, 2073, 2074, 2075, 2076, 2077, 2078, 2079, 2080, 2081, 2082, 2083, 2084, 2085, 2086, 2087, 2088, 2089, 2090, 2091, 2092, 2093, 2094, 2095, 2096, 2097, 2098, 2099, 2100, 2101, 2102, 2103, 2104, 2105, 2106, 2107, 2108, 2109, 2110, 2111, 2112, 2113, 2114, 2115, 2116, 2117, 2118, 2119, 2120, 2121, 2122, 2123;

    (b) a nucleotide sequence encoding a polypeptide, wherein said polypeptide is selected from the group consisting of SEQ ID NO: 2N, where N=1-335, SEQ ID NOs: 681, 682, 683, 690, 700, 703, 707, 715, 729, 730, 731, 743, 760, 761, 762, 763, 766, 767, 782, 783, 784, 785, 807, 808, 809, 810, 811, 830, 831, 832, 833, 845, 846, 847, 848, 849, 850, 851, 852, 865, 872, 873, 885, 886, 887, 898, 908, 919, 925, 932, 933, 934, 935, 936, 937, 943, 956, 957, 958, 959, 970, 971, 978, 979, 1001, 1002, 1003, 1004, 1010, 1011, 1012, 1013, 1023, 1024, 1032, 1040, 1041, 1042, 1050, 1054, 1064, 1065, 1074, 1075, 1076, 1077, 1078, 1079, 1086, 1087, 1092, 1093, 1112, 1131, 1132, 1133, 1134, 1145, 1150, 1151, 1152, 1170, 1171, 1184, 1185, 1186, 1194, 1195, 1197, 1198, 1202, 1208, 1219, 1220, 1221, 1232, 1233, 1241, 1252, 1253, 1254, 1272, 1273, 1274, 1275, 1276, 1279, 1287, 1288, 1289, 1290, 1291, 1292, 1301, 1308, 1309, 1315, 1317, 1331, 1332, 1333, 1334, 1350, 1352, 1354, 1356, 1358, 1360, 1362, 1364, 1366, 1368, 1370, 1372, 1374, 1376, 1378, 1380, 1382, 1384, 1386, 1388, 1390, 1392, 1394, 1396, 1398, 1400, 1402, 1404, 1406, 1408, 1410, 1412, 1414, 1416, 1418, 1420, 1422, 1424, 1426, 1428, 1430, 1432, 1434, 1436, 1438, 1440, 1442, 1444, 1446, 1448, 1450, 1452, 1454, 1456, 1458, 1460, 1462, 1464, 1466, 1468, 1470, 1472, 1474, 1476, 1478, 1480, 1482, 1484, 1486, 1488, 1490, 1492, 1494, 1496, 1498, 1500, 1502, 1504, 1506, 1508, 1510, 1512, 1514, 1516, 1518, 1520, 1522, 1524, 1526, 1528, 1530, 1532, 1534, 1536, 1538, 1540, 1542, 1544, 1546, 1548, 1550, 1552, 1554, and 1556;

    (c) a variant of any of the nucleotide sequences of (a) or (b) that has at least 70% sequence identity to a sequence of (a) or (b);

    (d) an orthologous sequence of any of the nucleotide sequences of (a) or (b) that has at least 70% identity to a sequence of (a) or (b);

    (e) a paralogous sequence of any of the nucleotide sequences of (a) or (b) that has at least 70% identity to a sequence of (a) or (b);

    (f) a nucleotide sequence that hybridizes to any of the nucleotide sequence of (a) or (b) under stringent conditions comprising 6×SSC and 65° C. in two wash steps of 10-30 minutes;

    (g) a nucleotide sequence encoding a polypeptide comprising a conserved domain that has at least 70% sequence homology with a conserved domain of a polypeptide encoded by any of the nucleotide sequences of (a)-(f), wherein said conserved domain is required for the function of the polypeptide encoded by any of the nucleotide sequences of (a)-(f) in regulating transcription and altering a trait in a transgenic plant.

Mendel Biotechnology, Inc.

Remarks

A corresponding patent (US 7196245) has been issued, in which claims on only two sequences were granted for a WRKY DNA-binding domain. This was also filed in Australia, Brazil, Canada, Japan, Mexico and the PCT.

The following are US patents that have issued from the above-described Mendel patent applications on "Polynucleotides and Polypeptides from Plants". Mendel, and probably its research ally Monsanto, has the right to exclude anyone from using technology covered by the claims below in the US or in products to be imported into the US. Note that while the claims are narrower than those in the corresponding patent application, this is no cause for feeling relieved about how many research avenues they leave open for development into deliverable products.

For example, in the patent 6,717,034 shown below, the scope of the claims is towards transgenic plants having increased biomass (compared to a control), and the methods involved in producing them.  The scope is further limited with respect to the sequence used to obtain this phenotype.  Specifically the DNA sequence in SEQ ID 1 is used, along with the peptide sequence in SEQ ID 2, to define the scope in plants. Claims include DNA sequences (and complementary sequences) that code for SEQ ID NO: 2 (including SEQ ID NO 1), and sequences that "bind under stringent conditions" to SEQ ID 1 (and complementary sequences). Such claim language opens the possibility of using other transcription factors that might result in similar useful phenotypes (e.g. would the equivalent transcription factor from another dicot, that codes for a peptide sequence other than SEQ ID NO 2, and is sufficiently different to SEQ ID NO 1, be a possible work-around for this patent?)  However, before making such a conclusion it would be important to examine the prosecution history, i.e. the interactions between the patent office and the applicant, about how the doctrine of equivalents would need to be taken into account).  

Also, this patent is one of a large INPADOC family, and broader claims to the same sequences, as well as to all the other sequences in the application, could still be pending.  This means that at any time patents with broader claims could still issue from any of the over 100 patent applications that are still pending.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 6717034

  • Earliest priority - 30 Mar 2001
  • Filed - 30 Mar 2001
  • Granted - 6 Apr 2004
  • Expected expiry - 30 Mar 2021

Title - Method for modifying plant biomass

Claim 1

A transgenic plant comprising
    a recombinant polynucleotide comprising:
         (a) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2, or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO; 2; or
        (b) a nucleotide sequence comprising SEQ ID NO: 1, or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1;

wherein expression of the recombinant polynucleotide comprising either (a) or (b) increases the plant's biomass as compared to a control plant not transformed with said recombinant polynucleotide.

Claim 3

A transgenic plant comprising
     a recombinant polynucleotide comprising
          a polynucleotide sequence that hybridizes over its full length under stringent conditions to:
              (a) a nucleotide sequence comprising SEQ ID NO: 1, or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; or
              (b) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2, or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2;

     wherein the stringent conditions comprise wash conditions of 0.2×SSC to 2.0×SSC, 0.1% SDS at 60-65° C. and
          wherein expression of the polynucleotide sequence that hybridizes to either (a) or (b) increases the plant's biomass as compared to a control plant not transformed with said recombinant polynucleotide.

Claim 5

A method for producing a plant having increased biomass; said method comprising:
     (a) providing an expression vector or cassette comprising a polynucleotide sequence selected from the group consisting of:
          (i) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2 or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2;
          (ii) a nucleotide sequence comprising SEQ ID NO: 1 or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; and
          (iii) a nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence of (i) or (ii),

     wherein the stringent conditions comprise wash conditions of 0.2×SSC to 2.0×SSC, 0.1% SDS at 60-65° C.;
          wherein expression of the nucleotide sequence of (iii) increases the plant's biomass as compared to a control plant not transformed with said expression vector or cassette; and

     (b) transforming a plant with the expression vector or cassette, thereby producing a plant that expresses the nucleotide sequence of (i), (ii) or (iii), said plant having increased plant biomass as compared to a control plant not transformed with the expression vector or cassette.

Claim 6

A transgenic plant comprising
     a recombinant polynucleotide comprising a polynucleotide sequence that hybridizes over its full length under stringent conditions to:
          (a) a nucleotide sequence comprising SEQ ID NO: 1, or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; or
          (b) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2, or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2; wherein the stringent conditions comprise wash conditions of 0.2×SSC, 0.1% SDS at 65° C., and

     wherein expression of the polynucleotide sequence that hybridizes to either (a) or (b) increases the plant's biomass as compared to a control plant not transformed with said recombinant polynucleotide.

Claim 7

A method for producing a plant having increased biomass; said method comprising:
     (a) providing an expression vector or cassette comprising a polynucleotide sequence selected from the group consisting of:
          (i) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2 or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2;
          (ii) a nucleotide sequence comprising SEQ ID NO: 1 or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; and
          (iii) a nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence of (i) or (ii),

     wherein the stringent conditions comprise wash conditions of 0.2×SSC, 0.1% SDS, at 65° C.;
          wherein expression of the nucleotide sequence of (iii) increases the plant's biomass as compared to a control plant not transformed with said expression vector or cassette; and (b) transforming a plant with the expression vector or cassette, thereby producing a plant that expresses the nucleotide sequence of(i), (ii) or (iii), said plant having increased plant biomass as compared to a control plant not transformed with the expression vector or cassette.


Original Applicant:

Mendel Biotechnology, Inc., Hayward, CA., USA

Patent or Publication No.

Title, Independent Claims and Summary

Assignee

US 7135616

  • Earliest priority - 18 Apr 2001
  • Filed - 09 Aug 2002
  • Granted - 14 Nov 2006
  • Expected expiry - 10 Apr 2021

Title - Biochemistry-related polynucleotides and polypeptides in plants

Claim 1

A method for producing a transgenic plant having increased tolerance to osmotic stress or drought relative to a wild-type plant of the same species, the method steps comprising:

(a) producing an expression vector comprising a nucleotide sequence encoding a polypeptide comprising a conserved domain with at least 80% sequence identity to amino acid coordinates 11–80 of SEQ ID NO: 66; and

(b) introducing the expression vector into the plant to produce a transgenic plant;

wherein the polypeptide is overexpressed in the transgenic plant and said overexpression results in the transgenic plant having increased tolerance to the osmotic stress or drought.

Claim 11

A method for producing a transgenic plant having delayed flowering relative to a wild-type plant of the same species, the method steps comprising:

(a) producing an expression vector comprising a nucleotide sequence encoding a polypeptide comprising a conserved domain with at least 80% sequence identity to amino acid coordinates 11–80 of SEQ ID NO: 66; and

(b) introducing the expression vector into the plant to produce a transgenic plant;

wherein the polypeptide is overexpressed in the transgenic plant and said overexpression results in the transgenic plant having the delayed flowering.

Claim 21

A method for producing a transgenic plant having increased lignin content relative to a wild-type plant of the same species, the method steps comprising:

(a) producing an expression vector comprising a nucleotide sequence encoding a polypeptide comprising a conserved domain with at least 80% sequence identity to amino acid coordinates 11–80 of SEQ ID NO: 66; and

(b) introducing the expression vector into the plant to produce a transgenic plant;

wherein the polypeptide is overexpressed in the transgenic plant and said overexpression results in the transgenic plant having increased lignification relative to the wild-type plant.

Mendel Biotechnology Inc.

Summary

Mendel's interest in transcription factors of Arabidopsis are clearly demonstrated in their patenting strategy.  Like Paradigm Genetics, Mendel's early patent applications contain claims to large groups of sequences.  However, unlike Paradigm, Mendel's patenting activities have extended beyond the initial bulk sequence claims.  A large and complex patent family has arrisen from a small number of initial patent applications.  Unlike Paradigm's applications, the '927 application discussed above is the result of a series of continuations of previous applications.  The '927 application has itself led to still further continuation applications.  Many of the Mendel applications are still active, and even those that have abandoned or expired have led to further applications. Unlike Paradigm, Mendel has granted patents for some transcription factors and methods relating to transcription factors (e.g.US 6664446 , US 6835540, and US 6717034).

Mendel has invested significant resources in developing this patent document family. It is likely that Mendel will continue this strategy to eventually obtain patents on more transcription factors from Arabidopsis.  The language of the granted claims is crucial in determining whether claims over other dicot species will be possible.  The recent granted patent, US 6717034 (towards a transcription factor involved in modifying plant biomass), has the following claim language:

Claim #3: A transgenic plant comprising a recombinant polynucleotide comprising a polynucleotide sequence that hybridizes over its full length under stringent conditions to: (a) a nucleotide sequence comprising SEQ ID NO: 1, or a sequence that is fully complementary to the nucleotide sequence comprising SEQ ID NO: 1; or (b) a nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2, or a sequence that is fully complementary to the nucleotide sequence encoding a polypeptide comprising SEQ ID NO: 2; wherein the stringent conditions comprise wash conditions of 0.2×SSC to 2.0×SSC, 0.1% SDS at 60-65° C. and wherein expression of the polynucleotide sequence that hybridizes to either (a) or (b) increases the plant's biomass as compared to a control plant not transformed with said recombinant polynucleotide.

Such language is more limiting in scope than that of the '927 patent.  However, it is not unreasonable to expect that such hybridization conditions may be met by at least homologs from other species within the mustard family. Whether-or-not homologs from other important crop species meet the same criteria cannot easily be deterimined, and would require experimental analysis. In part, this uncertainty evolves from the use of hybridization language in the claims.  Such claims, based on hybridization conditions, are difficult to quantify and the huge cost of possible litigation may act as a barrier to those interested in developing IP around transcription factors in other dicots.

To demonstrate the complexity of the patent document family that Mendel has created, consider the following diagram (the '927 application family):

Slide1

(Hexagons: represent pending applications; Diamonds: represent adandoned or expired applications; F = filing date; CIP = continuation-in-part application)

Broad claims in Applications:
Arguebly, it is the role of a good patent attorney to provide the best possible outcome for a client.  It is probably for this reason that the scope of initial claims in a patent application is often as wide as possible.  Such broad intent is demonstrated in the text of the '927 application, where the following is stated:

Sequences homologous, i.e., that share significant sequence identity or similarity, to those provided in the Sequence Listing (except CBF sequences SEQ ID NOs: 1955-1960), derived from Arabidopsis thaliana or from other plants of choice, are also an aspect of the invention. Homologous sequences can be derived from any plant including monocots and dicots and in particular agriculturally important plant species, including but not limited to, crops such as soybean, wheat, corn (maize), potato, cotton, rice, rape, oilseed rape (including canola), sunflower, alfalfa, clover, sugarcane, and turf; or fruits and vegetables, such as banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, pumpkin, spinach, squash, sweet corn, tobacco, tomato, tomatillo, watermelon, rosaceous fruits (such as apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, Brussels sprouts, and kohlrabi). Other crops, including fruits and vegetables, whose phenotype can be changed and which comprise homologous sequences include barley; rye; millet; sorghum; currant; avocado; citrus fruits such as oranges, lemons, grapefruit and tangerines, artichoke, cherries; nuts such as the walnut and peanut; endive; leek; roots such as arrowroot, beet, cassaya, turnip, radish, yam, and sweet potato; and beans. The homologous sequences may also be derived from woody species, such pine, poplar and eucalyptus, or mint or other labiates. In addition, homologous sequences may be derived from plants that are evolutionarily-related to crop plants, but which may not have yet been used as crop plants. Examples include deadly nightshade (Atropa belladona), related to tomato; jimson weed (Datura strommium), related to peyote; and teosinte (Zea species), related to corn (maize).

The importance of using hybridisation and sequence similarity language to broaden the scope of claims can be inferred from the following two figures from the '927 application.  These figures relate the evolutionary position of Arabidopsis to other important crops.  One may suspect that those crops most closely related to Arabidopsis would be most likely to have genes with highly homologous sequences (and thus satisfy the hybridisation and/or similarity ranges stated in the claims):

Figure 1 figure 2

Paradigm Genetics Inc. (Icoria Inc.  and Monsanto Co.)

Paradigm Genetics Inc. (Research Triangle Park, N.C., US) was founded in 1997 and went public in 2000. In its early years Paradigm's business statements emphasised making use of genomics information obtained from Arabidopsis, for extensions into crop plants via agricultural biotechnology.  During this time it sequenced a large number of Arabidopsis ESTs.

In 2004 Paradigm changed its name to Icoria Inc.  Icoria moved into medical biotechnology and sold assets relating to its agricultural genomics business to Monsanto Co.  In the period 1997-2004 Monsanto had extensive ties to the Arabidopsis genome sequencing project, and extensive bulk sequence patent filings in crop plants (e.g. rice, maize and cotton), although because of non-publication requests by Monsanto and other procedural matters, many of these were not known to the public until 2004 or later (see for example US 2004/31072, which claims 285,684 sequences mostly from soybean, with applications in many other related crop plants).

We have been able to find 9 patent applications with Paradigm Genetics Inc. as the applicant or correspondent.  Since the 2004 acquisition these may now be assigned to Monsanto, although USPTO assignment information provided by the applicants is not always kept up to date.  

Since all 9 patents are very similar, and each claim ~1000 ESTs from Arabidopsis, we have chosen one, US2001/44940 A1 (the '940 application) to discuss below.

See bar graphs showing the portions of the Arabidopsis genome that are covered by patent applications and granted patents that are assigned to Paradigm Genetics Inc.

Search Strategy

Since the major players in patenting of Arabidopsis sequences are known (from press releases and biotechnology publications), company names can be used in searches.  In this case Paradigm Genetics was used as part of a search strategy.

ThParadigm Genetics Inc. patents discussed  in this section were identified using the following search strategy.

Search details

Date of search

18 05 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(arabidopsis AND thaliana) AND (paradigm AND genetics)

Results

361

Comments

Of the 361 results identified using these search terms, 9 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims.  The correspondent named on these applications is PARADIGM GENETICS, INC., 104 ALEXANDER DRIVE, BUILDING 2, P O BOX 14528, RTP, NC, 277094528.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

As a precaution, a further search was performed:  this time no assignee/applicant/correspondent names were used.  Instead the strategy conducted was:

Search details

Date of search

14 06 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims)

Results

3,824

Comments

The results were sorted by relevance score, and the top 300 patent documents were investigated.  Whilst new and interesting Arabidopsis patents appeared, no-other Paradigm applications or patents covering bulk sequence applications could be identified (although some may have been missed by this search).

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patent applications for Arabidopsis bulk sequences  made by Paradigm Genetics Inc. (Icoria Inc. - Monsanto Co.)

The following is an example of one of the 9 patent applications by Paradigm Genetics Inc (now Icoria Inc.). The '940 A1 application has the earliest filing date of the 9 applications. All 9 applications are listed as "abandoned" on the USPTO website database.  However, one must remember that "abandoned" does not necessarily mean that Icoria or now Monsanto Co. have actually given up their attempts at patenting all or part of the claims. It may still be possible that future continuations, continuations-in-part, or divisional applications might appear as patent document publications.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2001/44940 A1

  • Earliest priority - 27 Jan 2000
  • Filed - 26 Jan 2001
  • Granted -Abandoned
  • Expected expiry -N/A

PAIR link

Title - Expressed sequences of Arabidopsis thaliana

Claim 1

A nucleic acid comprising
    a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911,
        or a fragment thereof.

Claim 5

A nucleic acid comprising:
    an ATG start codon;
    an optional intervening sequence;
    a coding sequence capable of hybridizing under stringent conditions as set forth in SEQ ID NO:1 to 911;
    and an optional terminal sequence, wherein
        at least one of said optional sequences is present, and wherein:
            ATG is a start codon;
            said intervening sequence comprises one or more codons in-frame with said coding sequence, and is free of in-frame stop codons;
            and said terminal sequence comprises one or more codons in-frame with said coding sequence, and a terminal stop codon.

Claim 14

A transgenic plant comprising
    an exogenous nucleic acid, wherein
        said nucleic acid comprises
            transcription regulatory sequences operably linked to a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911 or a fragment thereof, wherein
                said sequence is expressed in cells of said plant.

Claim 24

A genetically modified cell, comprising
    an exogenous nucleic acid, wherein
        said nucleic acid comprises
            transcription regulatory sequences operably linked to a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911, wherein
                said sequence is expressed in cells of said plant.

Claim 26

A nucleic acid array comprising
    at least one nucleic acid as set forth in SEQ ID NO:1-911 stably bound to a solid support.

Claim 27

An array comprising
    at least one polypeptide encoded by a nucleic acid as set forth in SEQ ID NO:1-911, stably bound to a solid support.

The claims are generally drawn to:

  • A nucleic acid comprising
    • a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911,
    • or a fragment thereof.

Essentially, this claims the ESTs themselves, any homologues, allellic variants,..etc. and may cover the equivalent gene in other organisms. The "comprising" language in this claim makes the claim's scope so broad that there almost certainly will be existing prior art.  Stringent hybridisation conditions are defined as "50°C or higher and 0.1×SSC".  "A fragment thereof" is not clear in itself and could imply a 2bp sequence! The application arguably provides the following guidance "Preferably, hybridization is performed using at least 15 contiguous nucleotides...".

  • A nucleic acid comprising:
    • an ATG start codon;
    • an optional intervening sequence;
    • a coding sequence capable of hybridizing under stringent conditions as set forth in SEQ ID NO:1 to 911;  and
    • an optional terminal sequence, wherein
      • at least one of the optional sequences is present, and
      • wherein: ATG is a start codon;
      •  the intervening sequence comprises
        • one or more codons in-frame with the coding sequence, and is free of in-frame stop codons; and
      • the terminal sequence comprises one or more codons in-frame with the coding sequence, and
      • a terminal stop codon.

This essentially claims DNA constructs or vectors containing the ESTs, or part of the ESTs, or their homologues, and any sequences from other organisms that satisfies the stringency conditions above (since no organism limits are made here!).  It is hard to imagine that none of these sequences would hybridize to DNA from other organisms (even human sequences!). How can an examiner be certain that any of these sequence claims are not invalidated by prior art (particularly other bulk sequence claims for rice and maize ESTs??!)

  • A transgenic plant comprising
    • an exogenous nucleic acid, wherein said nucleic acid comprises
      • transcription regulatory sequences operably linked to a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911 or a fragment thereof, wherein said sequence is expressed in cells of said plant.

This time the claim is towards a transgenic plant transformed with the claimed DNA. Arguably, the "operably linked" phrase may allow a work around of at least part of this claim: by the use of transactivation technology (see PCT publication WO 01/21781).

  • A genetically modified cell, comprising
    • an exogenous nucleic acid, wherein said nucleic acid comprises
      • transcription regulatory sequences operably linked to a sequence capable of hybridizing under stringent conditions to a sequence set forth in SEQ ID NO:1 to 911, wherein said sequence is expressed in cells of said plant.

Similar to the previous claim, for cells transformed with the claimed DNA. Again, consider transactivation technology and PCT publication WO 01/21781. Interestingly, the qualifier "plant" is not used to describe the cell. Does this imply that this claim covers animal, fungal, and protist cells, whereas the previous did not cover transgenic animals?

  • A nucleic acid array comprising
    • at least one nucleic acid as set forth in SEQ ID NO:1-911 stably bound to a solid support.

Attempts to cover use in microarray technologies. What does stably bound mean? What if the bond is reversible?

  • An array comprising
    • at least one polypeptide encoded by a nucleic acid as set forth in SEQ ID NO:1-911, stably bound to a solid support.

As above for DNA arrays, in this case for peptide arrays.  However, no mention is made of peptide length - this could mean anything from full length peptides to 2-amino acids! (There is no specification for this in the application)

Original Applicant:

PARADIGM GENETICS, INC.,
104 ALEXANDER DRIVE,
BUILDING 2,
PO BOX 14528, RTP,
NC, 277094528.

Which recently became:

Icoria
108 Alexander Drive
P. O. Box 14528
Research Triangle Park
North Carolina 27709-4528
Phone: 919-425-3000
Fax: 919-544-8094)

Monsanto has since acquired selected agricultural genomics assets from Icoria:

Monsanto Company
800 North Lindbergh Boulevard
St. Louis, MO-63167
(314) 694-1000

We would suggest that interested parties should try to contact:

Charles W Burson  Executive Vice President, General Counsel, and Secretary
Monsanto Company
800 North Lindbergh Boulevard
St. Louis, Missouri 63167-7843

Telephone: 314-694-1000

Remarks

Given the broad scope of the claims above, it is not expected that Paradigm's 9 bulk sequence applications would produce a granted patent over all the sequences listed.  Instead Paradigm's goal is probably to establish a priority date for all the sequences, and the opportunity to select from this group sequences that may become valuable and can be patented in divisional, CIPs, and continuation applications in the future. 

This unfortunately introduces uncertainty for those working on these or related genes.  Since it is virtually impossible to tell which genes will be the focus of future active applications.  It is also possible that future efforts by others to study the function of these genes may produce results that influence's Paradigm's/Icoria's/Monsanto's decision on which gene(s) to continue patenting efforts.  Future research that shows gene 'X' to be useful (valuable) might be sufficient to initiate further applications for that gene in divisional applications.


The remaining 8 applications are:

Application No.

Document Title

Sequences Claimed

US 2002/23280

Expressed sequences of arabidopsis thaliana

999

US 2002/23281

Expressed sequences of arabidopsis thaliana

999

US 2002/40489

Expressed sequences of arabidopsis thaliana

999

US 2002/40490

Expressed sequences of arabidopsis thaliana

999

US 2002/59663

Expressed sequences of arabidopsis thaliana

999

US 2002/62014

Expressed sequences of arabidopsis thaliana

999

US 2002/142319

Expressed sequences of arabidopsis thaliana

900

US 2003/115639

Expressed sequences of arabidopsis thaliana

999


The 8 remaining applications are:

Publication No.

Application No.

Filing Date

Publication Date

Document Title

Sequences Claimed

US 2002/23280

60/178,502

26 Jan 2001

21 Feb 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/23281

60/178,472

26 Jan 2001

21 Feb 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/40489

60/178,503

26 Jan 2001

04 Apr 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/40490

60/178,512

26 Jan 2001

04 Apr 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/59663

60/178,506

26 Jan 2001

16 May 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/62014

60/178,480

26 Jan 2001

23 May 2002

Expressed sequences of arabidopsis thaliana

999

US 2002/142319

60/148,784

07 Aug 2001

03 Oct 2002

Expressed sequences of arabidopsis thaliana

900

US 2003/115639

60/178,466

26 Jan 2001

19 Jun 2003

Expressed sequences of arabidopsis thaliana

999

Summary

Paradigm Genetics Inc. was essentially a start-up company, created to make use of (and patent) DNA and EST sequences generated from Arabidopsis. By making use of the new technology of EST sequencing it could potentially gain patents over a large number of Arabidopsis genes.  Due to the language of the patent claims:

"...hybridizing under stringent conditions to a sequence set forth in..."

it is entirely likely that successful claims would "read on" large numbers of related genes from other dicot species.  In particular those that have agricultural importance: potato, soya, canola,...

All 9 patent applications of Paradigm are classified as "abandoned" in the USPTO's database PAIR.  The assignee of these applications at the USPTO is still listed as Paradigm Genetics. Note also the fact that these are US applications, and as far as we are able to tell, were not the focus of applications elsewhere in the world.

Unfortunately, although classified as "abandoned" it is still too early to say for certain that these applications are no longer "active" for the following reasons:

  1. Continuations, continuations-in-part (CIP) and divisional applications may be filed prior to abandonment (and may still be in the process of publishing)
  2. Alternatively payments can be made to the USPTO so that some "abandoned" applications can be recovered
  3. There is some scope for US applicants to request that their applications not be published immediately

It is unlikely that, in their present state, patents would be granted on these applications, since the claims appear to be very broad. However, Paradigm's applications may demonstrate one early step in a bulk sequence patenting strategy: 

Apply for a patent over as many sequences as possible and establish the earliest possible priority date.

This strategy may have the following benefits:

By creating such important potential IP, a small company such as Paradigm became potentially valuable to larger companies such as Monsanto, who have large agricultural biotechnology interests.  The decision by Paradigm to move into human health biotechnology, was almost certainly facilitated by funds generated in the sale of agricultural assets to Monsanto. In turn, Monsanto acquired potentially useful IP and the loss of a competitor in Paradigm/Icoria.

Also, Monsanto is probably in a far better position to make use of such patented technologies. In fact Monsanto has its own bulk sequence claims for maize and cotton.

As we will see in the following chapters, others have refined this patenting strategy in Arabidopsis.

Ceres Inc.

Ceres is a privately-owned, US-based, plant biotechnology company (www.ceres.net) with headquaters in Thousand Oaks, California. It makes use of genomic technologies towards developments in agriculture and human health.  Ceres specialises in high throughput screening, full length cDNA sequencing, plant breeding, and targeted gene activation. Interestingly, as with Paradigm and Mendel, Ceres has ties to Monsanto, with whom it has extensive license-based agreements.

Ceres

The company states clearly on its website that the development and protection of IP is an important goal. On the same site it claims that Ceres has filed applications on over 50,000 full length genes and 10,000 promoter sequences.  Ceres' goal of sequencing full-length cDNA sequences compared to ESTs or genomic sequences appears to arise from the need to gain composition of matter claims in US patents (and it seeks at least one substantial utility per gene sequence for patenting purposes). Ceres' genomics facilities apparently allow it to be able to investigate gene function such that descriptions of multiple utilities for each gene of interest can be obtained.

We chose to analyse a some recent applications by Ceres covering both large numbers of functional Arabidopsis sequences and also promoter sequences. Ceres' patent applications provide a more recent example of a bulk sequence application for analysis, and for comparison to the Paradigm and Mendel applications.  Analysis of one example promoter application is also useful since Ceres has interests in this technology.

Ceres' IP interests in Arabidopsis are focused towards claims on promoter sequences and full length genes with agricultural utility.

To date, Ceres has filed patent applications on over 50,000 full-length functionally annotated genes and over 10,000 promoter sequences

Intellectual Property

Ceres places a large emphasis on the generation and protection of intellectual property. Ceres' intellectual property strategy leverages three major competitive advantages: (1) the ability to discover and sequence full-length genes in a high-throughput manner, (2) an integrated suite of genomic technologies which enables Ceres to ascribe specific utilities to a very high proportion of the genes that it discovers, and (3) information management systems which enable the rapid and accurate filing and follow-up of thousands of patent applications.

Ceres practices a proprietary technology for capturing and sequencing full-length cDNAs. Unlike ESTs, these sequences cover the complete and accurate open reading frame of a gene giving the information necessary for a cell to manufacture the encoded protein. Genomic sequence can sometimes give equivalent information, but accurate predictions of gene structure from genomic sequence are only accurate in approximately 50% of the cases tested. The US Patent and Trademark Office has indicated that accurate full-length genes are necessary for the issuance of composition of matter claims that cover the production of the encoded protein. In addition, at least one substantial and practical utility must be described for a particular gene sequence. Ceres' integrated genomics engine enables the description of multiple utilities for any given gene. The combination of this utility information and the full-length gene sequences should enable Ceres to gain intellectual property protection for a large portfolio of important plant genes.

The generation and integration of such a large amount of data into accurate patent application submissions requires a sophisticated information management effort. Ceres has designed and incorporated information management technologies into its technology platforms to enable the timely and accurate generation of thousands of patent applications. To date, Ceres has filed patent applications on over 50,000 full-length functionally annotated genes and over 10,000 promoter sequences.

http://www.ceres-inc.com/about/intell.html

April 3, 2002

Monsanto and Ceres Announce Important Product Discovery and Development Collaboration

St. Louis, Missouri and Los Angeles, California -- Monsanto Company and Ceres, Inc. today announced an important product discovery and development collaboration focused on applying genomics technologies to provide improvements in certain agricultural crops.

"Our new alliance with Ceres provides us access to a proprietary knowledge base that will accelerate Monsanto's product pipeline. By marrying the complementary genomics capabilities of Ceres with the product development capabilities of Monsanto, we will be in a position to deliver additional value to farmers worldwide," said Hendrik Verfaillie, chief executive officer of Monsanto.

Walter De Logi, chief executive officer of Ceres, said, "Monsanto is an ideal partner for Ceres because it is a leader in providing biotechnology-based agricultural products to the world's farmers. Monsanto has the resources to give Ceres' technology the widest possible distribution. Furthermore, we share a vision of providing abundant food and a healthy environment around the world."

Under the collaboration, Monsanto will acquire rights to Ceres technologies in certain crops and applications in exchange for payments over several years. Ceres will receive additional payments subject to meeting specified objectives for developing additional related technology, as part of its continuing commitment to genomics-based product discovery. Monsanto will also fund a jointly implemented research program and has made a minority equity investment in Ceres. Expected payments to Ceres under the collaboration are $137 million over several years plus potential royalties.

As part of the collaboration, Monsanto and Ceres have agreed to make technologies accessible to farmers in developing countries, including non-profit humanitarian applications not served by commercial markets or research and development investments. This innovative feature of the collaboration reflects the commitment of both companies to bring the benefits of science and technology to society through public-private partnerships to improve subsistence crops, in addition to the commercial development of products around the world.

Ceres, Inc. is a privately held biotechnology company utilizing multiple integrated plant genomics technologies to develop innovative products. In addition to pursuing opportunities in the food, feed and fiber industries, Ceres' business strategy consists of leveraging its technology platform in the agrochemical, chemical and pharmaceutical industries.

Monsanto Company (NYSE: MON) is a leading provider of agricultural solutions to growers worldwide. Monsanto's employees provide top-quality, cost-effective and integrated approaches to help farmers improve their productivity and produce better quality foods. For more information on Monsanto, see: www.monsanto.com.

http://www.ceres-inc.com/PressRelease_News/04_03_02.html

Ceres Announces Genomics Milestone in Energy Crop Enhancement Program (info
12,000 full-length switchgrass genes sequenced and genetic variation characterized   

THOUSAND OAKS, CA – July 10, 2006
Contact:   Ceres, Inc.  Shirley Bell  sbell@ceres-inc.com

www.oxbio.com/pressreleases/071006_Ceres.pdf 

MONSANTO COLLABORATES WITH CERES 

Monsanto Company and the privately owned biotechnology company, Ceres Inc, Los Angeles, have announced collaboration to apply genomics technologies to provide improvements in agricultural crops. "Our new alliance with Ceres provides us access to a proprietary knowledge base that will accelerate Monsanto's product pipeline.  By marrying the complementary genomics capabilities of Ceres with the product development capabilities of Monsanto, we will be in a position to deliver additional value to farmers worldwide," said Hendrik Verfaillie, CEO of Monsanto.

Under the collaboration, Monsanto will acquire rights to technologies in certain crops and applications in exchange. Expected payments to Ceres through the deal will be $137 million over several years as well as possible royalties. Monsanto and Ceres have also agreed to make some technologies accessible to farmers in developing countries, including non-profit humanitarian applications.

Crop Protection Monthly - 30 April 2002, Issue No. 149
Published by: Market Scope Europe Ltd     
Website: http://www.crop-protection-monthly.co.uk
Editor: Brian R. Hicks
E-mail: brianralphhicks@aol.com

Search Strategy

Since the major players in patenting of Arabidopsis sequences are known, the company names could be used in searches.  In this case "Ceres" was used as part of a search strategy.

The Ceres Inc. patents discussed  in this section were identified using the following search strategy.

Search details

Date of search

18 05 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

 Arabidopsis AND (Ceres in applicant)

Results

36

Comments

Of the 36 results identified using these search terms, 2 results were identified as having bulk sequence claims based on the content of their titles, abstracts, and claims.  The applicant named on these applications is CERES, INC., Thousand Oaks, CA, US.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patent applications for Arabidopsis bulk sequences made by Ceres Inc.

The following is an example of a recent patent application by Ceres towards a large group of Arabidopsis sequences. Whilst there are many-more-than 100 sequences in this application (2523 sequence IDs listed), approximately half of these are for the corresponding protein sequence, and some sequences are from corn, soya bean, wheat, brassica, and others. This patent document also provides an example of a more recent bulk sequence application, than those of Paradigm or Mendel. The focus of this application is towards Arabidopsis polypeptides and the sequences encoding them,...

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2006/0057724 A1

  • Earliest priority - 30 Jun 2004
  • Filed - 30 Jun 2005
  • Granted - Pending
  • Expected expiry - N/A

Title - Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics and phenotypes

Claim 1:

An isolated nucleic acid molecule comprising:
         a) a nucleic acid having a nucleotide sequence which encodes an amino acid sequence exhibiting at least 85% sequence identity to an amino acid sequence in the Sequence Listing or in the ortholog alignments of FIG. 1;
         b) a nucleic acid which is a complement of a nucleotide sequence according to paragraph (a);
         c) a nucleic acid which is the reverse of the nucleotide sequence according to subparagraph (a), such that the reverse nucleotide sequence has a sequence order which is the reverse of the sequence order of the nucleotide sequence according to subparagraph (a); or
         d) a nucleic acid capable of hybridizing to a nucleic acid according to any one of paragraphs (a)-(c), under conditions that permit formation of a nucleic acid duplex at a temperature from about 40° C. and 48° C. below the melting temperature of the nucleic acid duplex.

Claim 9:

An isolated polypeptide comprising an amino acid sequence exhibiting at least 85% sequence identity of an amino acid sequence of the Sequence Listing or in the ortholog alignments of FIG. 1.

The claims are generally drawn to:

  • A nucleic acid molecule comprising:
    • a) a nucleic acid encoding a peptide with a minimum of 85% sequence identity to a peptide in the Sequence Listing or in FIG. 1.
    • b) a nucleic acid which is a complement of a sequence according to paragraph (a);
    • c) a nucleic acid which is the reverse of the sequence according to subparagraph (a), such that the reverse nucleotide sequence has a sequence order which is the reverse of the sequence order of the sequence according to subparagraph (a); or
    • d) a nucleic acid capable of hybridizing to the sequences of paragraphs (a)-(c), at a temperature from about 40° C. and 48° C. below the melting temperature of the nucleic acid duplex.

The 85% sequence identity at the peptide level is broad, opening the possibility of finding prior art (although this might require analysis tools that may not be available to examiners). The hybridization language used here also allows the possibility of very broad claim scope. Again opening up the possibility of finding prior ar.  The language used, and the 40° C to 48° C below Tm stipulation make it extremely difficult to do a thorough analysis of prior art. However, tools that allow such analysis might very-well lead to prior art discovery. The patent text defines the hybridization temperatures (-40 to -48° C from Tm) of this claim to be equivalent to "low stringency" conditions!

  • An isolated polypeptide comprising an amino acid sequence exhibiting at least 85% sequence identity of an amino acid sequence of the Sequence Listing or in FIG. 1.

One interpretation of this is that claims over the orthologs listed in FIG 1 are also being made, and sequences 85% identical to them.  This again broadens the scope of the claim - and may be enough to read on prior art.  Discovery of such prior art would require a thorough analysis of not only sequences that are 85% identical to the SEQ ID No., but also of the orthologs in the alignment in Fig1. Are examiners able to do this effectively, given the time and resources available to them?

Original Applicant:

CERES, INC.,  Thousand Oaks, CA (US)

For information regarding this application and possible licensing, we would suggest contacting:

Dr Peter Mascia
Ceres, Inc.
Director of Product Development
1535 Rancho Conejo Blvd.
Thousand Oaks, CA 91320
Phone: (805) 376-6500
Fax: (805) 498-1002
 pmascia@ceres-inc.com

Remarks

  The scope of claimed sequences for this Ceres application relies on "hybridization language", as does the Paradigm Genetics applications discussed previously.  Such claims scope are difficult to quantify. In this case Ceres has provided a temperature range (compared to Tm) for their hybridisation conditions.  Unfortunately, by comparing this range to the literature (and their own) definition of stringency, their definition constitutes "low stringency" conditions.  Thus their claims to Arabidopsis sequences almost certainly will cover the same or similar sequences from many dicots.

Other patent documents of interest

US 2006/15970 A1

Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics

~65 Arabidopsis sequences

Patents claiming Promoter Sequences from Arabidopsis thaliana

The following is an example of granted patents for promoter sequences from Ceres Inc. Although the application shown here contains less than 100 sequences, they are included to demonstrate Ceres' interest in Arabidopsis promoters and the relationship between the claims in granted patents and patent applications. Additionally, Ceres has many applications and patents for promoters, which together might constitute claims equivalent to a bulk sequence application.

The following 816' application does not list Ceres as the applicant or correspondent!  By performing google searches using the inventors names (shing kwok yu-ping) it was possible to infer that these inventors were employed by Ceres around the same tme that the patent application was made (the publication number 20060008816 is not present in the USPTO assignee database at the time of this analysis).  Hence the inclusion of the application below:

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2006/8816 A1

  • Earliest priority - 6 Nov 2003
  • Filed - 04 Nov 2004
  • Granted - Pending
  • Expected expiry - N/A

Title - Promoter, Promoter control elements, and combinations, and uses thereof

Claim 1

An isolated nucleic acid molecule capable of modulating transcription wherein

the nucleic acid molecule shows at least 80% sequence identity to a sequence set forth in Table 1, or
a complement thereof.

Claim 6

A vector construct comprising:

  • a) a first nucleic acid capable of modulating transcription wherein the nucleic acid molecule shows at least 80% sequence identity to a sequence set forth in Table 1; and
  • b) a second nucleic acid having to be transcribed,
  • wherein said first and second nucleic acid molecules are heterologous to each other and are operably linked together.
Claim 13

A method of modulating transcription by combining, in an environment suitable for transcription:

  • a) a first nucleic acid molecule capable of modulating transcription wherein the nucleic acid molecule shows at least 80% sequence identity to a sequence set forth in Table 1; and
  • b) a second molecule to be transcribed;
  • wherein the first and second nucleic acid molecules are heterologous to each other and operably linked together.

The claims are generally drawn to:

  • nucleic acids similar to those in table 1 (at least 80% identity) capable of modulating transcription (claim 1)
  • constructs comprising the sequences above (claim 6)

  • a method of modulating transcription by the sequences above (claim 13).

Original Applicant:


Listed inventors


Assumed, Present Assignee:

Ceres Inc.

Remarks

It is interesting to note that using the USPTO assignee database to search for "CERES" as ASSIGNEE returns patent documents that have the University of California named as applicant and that were later assigned to Ceres.  Hence using CERES in the search below is likely to miss granted patents now assigned to Ceres. Additionally there is no guarantee that patents now assigned to Ceres will have their assignments updated in the USPTOs databses!

Search Strategy Used

The patents for promoter sequences discussed in this section were identified using the following search strategy.

Search details

Date of search

08 09 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(Arabidopsis AND thaliana AND Ceres) AND (promoter in claims) AND (SEQ AND ID) 

Results

Of the 34 results obtained from this search, many had no obvious links to Ceres. Confirming the identity of Ceres as the assignee was difficult since "Ceres" as a search term uncovered documents containing "Ceres" linked to materials used in experimentation (e.g. Ceres microarrays...). For this reason we chose to discuss only the single document above to demonstrate Ceres' interests in Arabidopsis promoters. Please take note that this document was chosen because it is part of a larger document family with 5 US and one PCT applications. It is likely that the above represents only a small part of the promoter sequences included in patent documents.

Comments

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

US 2006/150283

 Publication No.

Title, Independent Claims and Summary

Applicant

US 2006/150283 A1

  • Earliest priority - 13 Feb 2004
  • Filed - 14 Feb 2005
  • Granted - Pending
  • Expected expiry - N/A

PAIR Link

Title - Sequence-determined DNA fragments and corresponding polypeptides encoded thereby

Claim 1

An isolated nucleic acid molecule comprising:

    a) a full length cDNA nucleic acid having a nucleotide sequence which encodes an amino acid sequence exhibiting at least 40% sequence identity to an amino acid sequence encoded by

        (1) a full length cDNA nucleotide sequence described in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof; or

        (2) a complement of a full-length cDNA nucleotide sequence shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof;

    b) a nucleic acid which is the reverse of the nucleotide sequence according to subparagraph (a), such that the reverse nucleotide sequence has a sequence order which is the reverse of the sequence order of the nucleotide sequence according to subparagraph (a);

    c) a nucleic acid capable of hybridizing to a nucleic acid having a sequence selected from the group consisting of:

        a full-length cDNA nucleotide sequence which is shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents; and

        a nucleotide sequence which is complementary to a full-length cDNA nucleotide sequence shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, under conditions that permit formation of a nucleic acid duplex at a temperature from about 40° C and 48° C below the melting temperature of the nucleic acid duplex, with the proviso that said nucleotide sequence is not any of the sequences described in the Tables of any of Patent Publication Nos. WO 200040695, CA 2300692 A1, EP 1033405 A2, CA 2302828 A1 and EP 1059354 A2 and any proteins listed in the application that are identified by gi number or otherwise as being from the non-redundant GenBank CDS translations or Protein Database (PDB), available via the internet, or (PIR-International) Database (PIR), available via the internet.

Claim 2

An isolated nucleic acid molecule comprising a nucleic acid having a nucleotide sequence which exhibits at least 65% sequence identity to

    a) a full-length cDNA nucleotide sequence shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof; or

    b) a complement of a full-length cDNA nucleotide sequence described in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof, with the proviso that said nucleotide sequence is not any of the sequences described in the Tables of any of Patent Publication Nos. WO 200040695, CA 2300692 A1, EP 1033405 A2, CA 2302828 A1 and EP 1059354 A2 and any proteins listed in the application that are identified by gi number or otherwise as being from the non-redundant GenBank CDS translations or Protein Database (PDB), or (PIR-International) Database (PIR).

Claim 11

An isolated polypeptide comprising an amino acid sequence

    a) exhibiting at least 40%, or 75%, or 85%, or 90% sequence identity of an amino acid sequence encoded by a sequence shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof; and

    b) capable of exhibiting at least one of the biological activities of the polypeptide encoded by said nucleotide sequence shown in the Seqeuence Listing or the Sequence Listing-Miscellaneous Feature documents, or a fragment thereof, with the proviso that said nucleotide sequence is not any of the sequences described in the Tables of any of Patent Publication Nos. WO 200040695, CA 2300692 A1, EP 1033405 A2, CA 2302828 A1 and EP 1059354 A2 and any proteins listed in the application that are identified by gi number or otherwise as being from the non-redundant GenBank CDS translations or Protein Database (PDB), or (PIR-International) Database (PIR).

Ceres, Inc.

Remarks

We have no information that a corresponding patent application has been filed in any other country besides the USA.

This patent application is directed to nucleotide sequences and their encoded polypeptides from soybean, wheat , maize and canola, as well as Arabidopsis thaliana. According to sequence information published in the USPTO's PSIPS database, 1032 sequences are included in this application.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 20070039067 A1

  • Earliest priority - 30 Sep 2004
  • Filed - 30 Sep 2005
  • Granted - Pending
  • Expected expiry - N/A

Title - Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics

Claim 1

An isolated nucleic acid molecule comprising:

    a) a nucleic acid having a nucleotide sequence which encodes an amino acid sequence exhibiting at least 85% sequence identity to an amino acid sequence in Sequence Listing;

    b) a nucleic acid which is a complement of a nucleotide sequence according to paragraph (a);

    c) a nucleic acid which is the reverse of the nucleotide sequence according to subparagraph (a), such that the reverse nucleotide sequence has a sequence order which is the reverse of the sequence order of the nucleotide sequence according to subparagraph (a);

    d) a nucleic acid which is an interfering RNA to the nucleotide sequence according to subparagraph (a); or

    e) a nucleic acid capable of hybridizing to a nucleic acid according to any one of paragraphs (a)-(c), under conditions that permit formation of a nucleic acid duplex at a temperature from about 40° C and 48° C below the melting temperature of the nucleic acid duplex.

Ceres, Inc.

Remarks

US Application with missing sequence identifiers

Publication No.

Title, Independent Claims and Summary

Assignee and licensing information

US 2006/150285 A1

  • Earliest priority - 16 Dec 2004
  • Filed - 16 Dec 2005
  • Publication - 6 Jul 06

Title - Nucleotide sequences and polypeptides encoded thereby for enhancing plant drought tolerance

Claim 1

An isolated nucleic acid molecule comprising: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 68, 69, 94 and 95, corresponding to SEQ ID Nos. ______-______, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to any one of SEQ ID Nos. ______-______; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 68, 69, 94 and 95, corresponding to SEQ ID Nos. ______, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-4 or one of the polypeptide sequences in SEQ ID NOs. 1-93 and 173-176.

Claim 2

A vector, comprising: a) a first nucleic acid having a regulatory region encoding a plant transcription and/or translation signal; and a second nucleic acid having a nucleotide sequence according to any one the nucleotide sequences of claim 1, wherein said first and second nucleic acids are operably linked.

Claim 3

A method for increasing drought tolerance in a plant, said method comprising introducing into a plant cell an isolated nucleic acid comprising: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 68, 69, 94 and 95, corresponding to SEQ ID Nos. ______-______, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to any one of SEQ ID Nos. ______-______; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 68, 69, 94 and 95, corresponding to the full length or the CDS identified in SEQ ID Nos. 6, 25, 57 and 71, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-4 or one of the polypeptide sequences in SEQ ID NOs. 1-93 and 173-176, wherein said plant produced from said plant cell has increased drought tolerance as compared to the corresponding level of a control plant that does not comprise said nucleic acid.

Claim 16

A plant cell comprising an isolated nucleic acid comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 68, 69, 94 and 95, corresponding to SEQ ID Nos. ______-______, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to the full length sequence or CDS of any one of SEQ ID Nos. 6, 25, 57 and 71; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 68, 69, 94 and 95, corresponding to SEQ ID Nos. ______-______, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-4.

Claim 25

A method for increasing drought tolerance in a plant, comprising: (a) transforming a plant with a nucleic acid molecule comprising a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in any one of FIGS. 1-4; and (b) expressing said nucleotide sequence in said transformed plant, whereby said transformed plant has an increased drought tolerance as compared to a plant that has not been transformed with said nucleotide sequence.

Ceres, Inc.

US 2006/195943 A1

  • Earliest priority - 8 Dec 2004
  • Filed - 8 Dec 2005
  • Publication - 31 Aug 06
Title

- Nucleotide sequences and corresponding polypeptides conferring modulated plant size and biomass in plants

Claim 1

An isolated nucleic acid molecule comprising: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to any one of SEQ ID Nos. 1, 9, 16, 21, 26 and 3; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-5.

Claim 3

A method of modulating plant size, modulating vegetative growth, modulating plant architecture and/or modulating the plant biomass, said method comprising introducing into a plant cell an isolated nucleic acid comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to any one of SEQ ID Nos. 1, 9, 16, 21, 26 and 3; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-5, wherein said plant produced from said plant cell has modulated plant size, modulated vegetative growth, modulated plant architecture and/or modulated biomass as compared to the corresponding level in tissue of a control plant that does not comprise said nucleic acid.

Claim 15

A plant cell comprising an isolated nucleic acid comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 85% identical to any one of Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; (b) a nucleotide sequence that is complementary to any one of the nucleotide sequences according to paragraph (a); (c) a nucleotide sequence according to any one of SEQ ID Nos. 1, 9, 16, 21, 26 and 3; (d) a nucleotide sequence that is in reverse order of any one of the nucleotide sequences according to (c) when read in the 5′ to 3′ direction; (e) a nucleotide sequence that is an interfering RNA to the nucleotide sequence according to paragraph (a); (f) a nucleotide sequence able to form a hybridized nucleic acid duplex with the nucleic acid according to any one of paragraphs (a)-(d) at a temperature from about 40° C. to about 48° C. below a melting temperature of the hybridized nucleic acid duplex; (f) a nucleotide sequence encoding any one of the amino acid sequences identified as Leads 11, 17, 50, 58, 13/64 and 12/67, corresponding to SEQ ID Nos. 2, 10, 17, 22, 27 and 4, respectively; or (g) a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in FIGS. 1-5.

Claim 24
A method for promoting increased biomass in a plant, comprising: (a) transforming a plant with a nucleic acid molecule comprising a nucleotide sequence encoding any one of the lead, functional homolog or consensus sequences in any one of FIGS. 1-5; and (b) expressing said nucleotide sequence in said transformed plant, whereby said transformed plant has an increased biomass as compared to a plant that has not been transformed with said nucleotide sequence.

Remarks

The related PCT publication is WO 2006/76099 A2. We have no information that a corresponding patent application has been filed in any other country besides the USA.

Summary

Cere's stated interests in patenting of whole cDNA sequences and promoters is supported by the patent documents we found for Arabidopsis

Our analysis of bulk sequence applications focused on their 724' A1 application. This document was chosen becuase it represents a more recent example of a bulk sequence application than the Paradigm or Mendel applications.  Interestingly, there are a few differences between such applications in the past 6-or-so years.  Differences between the Cere's and Paradigm applications include the change from EST sequence claims (whch are effectively partial coding sequences only), towards complete cDNA sequences. One significant difference between Ceres and Paradigm's applications is the fact that Ceres has analysed missexpression of the claimed peptides in plants and made claims based on sequences that are more likely to have utility.

Mendel, in the patent application describes previously, also investigated the function of claimed transcription factors by over expression or knock-down experiments.  However, unlike Mendel, the Ceres' bulk sequence application appears to be at a much earlier stage of the patenting process.  The patent family surrounding the Mendel application is far more extensive and complex than that of the present Ceres application.  It is not unreasonable to suspect that Ceres patenting strategy for Arabidopsis has been guided by knowledge and experience of previous bulk-sequence claims, and perhaps by the USPTO's requirement for significant utility in sequence applications.

Some interesting points, underlined by the analysis of the promoter patent applications, were:

Considering the points above, it is a difficult and time-consuming task to actually identify the present assignee of a particular patent.  This is particularly true if there is little benefit for the assignee in disclosing this information in a  clear and public fashion.  It is entirely possible that many Ceres patent documents may not be identified from the analysis conducted here, which used a combination of both a standard patent document search (using "Ceres" in FULLTEXT) and a USPTO assignments database search (using "CERES" in ASSIGNEE).

Dow Chemical Company.

Dow Chemical Company is a multinational, US-based, corporation with headquarters in Michigan.  Followed by DuPont, it is the largest chemical company in the world (based on market capitalisation). It has interests not only in chemicals, but also in  agricultural products, plastics and specialised products and services.

Its wholly-owned subsidiary, Dow AgroSciences LLC, is based in Indianapolis, Indiana, USA. Dow AgroSciences has interests in crop protection and crop biotechnology.  Both Dow and Dow AgroSciences are applicants on a number of applications involving Arabidopsis.  Two of these applications that appear to deal with bulk sequence claims are discussed below.

Search Strategy

Since the major players in patenting of Arabidopsis sequences are known, the company names could be used in searches.  In this case "Dow" was used as part of a search strategy. Unfortunately using arabidopsis AND (Dow in applicant) AND ((SEQ AND ID) in claims) did not uncover any patent documents of interest for bulk sequnence claims.  Hence the general search strategy was broadened to the more general search below:

The Dow Chemicals Co. patents discussed  in this section were identified using the following search strategy.

Search details

Date of search

19 07 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

arabidopsis AND dow AND (seq AND ID)

Results

372

Comments

These results, when sorted according to revelvance score, uncovered some interesting applications in the top 100 documents. In the top 100 documents we discovered 8 of interest. These included US and PCT applications, with titles beginning " NUCLEIC ACID COMPOSITIONS CONFERRING..." By further researching these applications the documents dicussed below were uncovered.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patent applications for Arabidopsis bulk sequences made by Dow Chemical Co.

The following are patent applications by Dow Chemical Co. These applications cover a large number of claimed sequences (7,554 SEQ ID NOs in the first below). However, these applications contain sequences from a number of different organisms. Thus, not all the sequences are specifically from Arabidopsis. Claims are targeted towards sequences that result in altered metabolic, visual, and disease tolerance characteristics (i.e. those that have possible utility). There are 177 Arabidopsis sequences claimed in the first application below, and the other plant sequences claimed (e.g. from Nicotiana benthamiana) may overlap with other dicot species (including some from Arabidopsis) - depending on the scope of claim language used.

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2005/91708 A1

  • Earliest priority - 31 Aug 2001
  • Filed - 30 Aug 2002
  • Granted - Pending
  • Expected expiry - N/A

Title - Nucleic acid compositions conferring altered metabolic characteristics

Claim 1

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-7554 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic.

Claim 26

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-7554 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency for use in producing plants with an altered metabolic characteristic.

Claim 28

A method for identifying altered metabolic characteristics in a biological system comprising isolation of metabolites, comparing for altered metabolic characteristics relative to a control or reference using a bioinformatics system and suitable analytical methodology.

The claims are generally drawn to:

  • An isolated nucleic acid selected from the group consisting of
    • SEQ ID NOs: 1-7554 and
    • nucleic acid sequences that hybridize to any thereof under conditions of low stringency,  
      • wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic.

Conditions of low stringency may allow many of these sequences to hybridise to a range of sequences from dicots, monocots, and fungi.  Thus these are very broad claims, and unlikely to be granted in their present state. The properties and binding characteristics of the "Artificial" sequences would require further analysis to determine whether they overlap with Arabidopsis sequences...

  • An isolated nucleic acid selected from the group consisting of
    • SEQ ID NOs: 1-7554 and
    • nucleic acid sequences that hybridize to any thereof under conditions of low stringency
  • for use in producing plants with an altered metabolic characteristic.

As above...

  • A method for identifying altered metabolic characteristics in a biological system comprising
    • isolation of metabolites,
    • comparing for altered metabolic characteristics relative to a control or reference using a bioinformatics system and
    • suitable analytical methodology.

This method claim is not well defined, and is very broad in scope.  It is unlikely that the specifications and descriptions in the patent itself are enough to enable all aspects of these claims. Which metabolites? What isolation method? What metabolic characteristics? What bioinformatics system? What is a "suitable analytic methodology"?

Original Applicant:

THE DOW CHEMICAL COMPANY,
P. O. BOX 1967,  MIDLAND, MI, 48641-1967, US

Remarks

The following organism sequences are present in the application:
"Artificial" (5,807), Trichoderma harzianum (1,095), Nicotiana benthamiana (261), Arabidopsis thaliana (177), Poppy (83), Oryza sativa (78), and Saccharomyces cereviseae (59). The total number of sequences in the file header was 7,560, which does not match either the totals here, or the number of claimed sequences (implying that there may be typographical errors in the original sequence file, or as-yet unidentified organism identifiers in the datafiles).

WO 2003/020936 A1

There is a corresponding PCT application that deals with the same sequences. However, no other applications could be found at this time in other jurisdictions.

Dow Agrosciences has a similar application for a group of 2,000 sequences:

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2004/249146 A1

  • Earliest priority - 31 Aug 2001
  • Filed - 26 Jul 2004
  • Granted - Pending
  • Expected expiry - N/A

Title - Nucleic acid compositions conferring altered visual phenotypes

Claim 1

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered visual phenotype.

Claim 12

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency for use in producing a plant with an altered visual phenotype.

The claims are generally drawn to:

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency, wherein expression of said isolated nucleic acid in a plant results in an altered visual phenotype.

An isolated nucleic acid selected from the group consisting of SEQ ID NOs: 1-2065 and nucleic acid sequences that hybridize to any thereof under conditions of low stringency for use in producing a plant with an altered visual phenotype.

Original Applicant:
DOW AGROSCIENCES LLC , 9330 ZIONSVILLE RD, INDIANAPOLIS, IN, 46268,

Remarks

Many of the sequences in this application have no clear indication of which organism they are from.  Many only have numerical identifiers, and it is difficult to tell which are from Arabidopsis.  Additionally, we could not find a sequence file deposited with the USPTO (one must exist however), and so we could not easily identify Arabidopsis sequences using this method. We chose several OCRed sequences from the sequence list and discovered that they were not present in GenBank at NCBI. 

WO 0003/00020741 A1

There is a corresponding PCT application that deals with the same sequences. However, no other applications could be found at this time in other jurisdictions.

Summary

Dow and its subsidiary DowAgroscience have interests in biotechnology R&D focused towards:

The bulk sequence documents analysed above claim a mixture of sequences from a number  of different plant species, both monocots and dicots. For Dow, sequences from Arabidopsis appear to be less important than claims on sequences from organisms that have more direct economic value such as maize, cotton, sorghum, alfalfa and soybeans, and some species that are both model systems and crops, such as tobacco or rice.  Additionally the Dow applications studied also contain a large number of claims to fungal sequences.  For example, Trichoderma harzianum (sequences claimed in the patent document discussed above) is a fungus that is used as a biocontrol measure to protect crop plants against fungal pathogens such as Fusarium sp. (a fungal pathogen of cotton, amongst others).

Dow and DowAgroscience's diverse interests are reflected in their bulk sequence applications.  So although these applications contain some Arabidopsis sequences, Arabidopsis is less a focus of patenting activity for Dow than Nicotiana benthamiana, another popular model plant system. The interest in N. benthamiana may be due to a number of reasons, including:

Applications claiming sequences from dicot crops

Monsanto Company has patent applications claiming bulk sequences from dicot crops such as soybean and cotton. We include these patent applications here because many of the claimed sequences from these crops are orthologous to sequences from A. thaliana.

US 2007/83945

This patent application covers soybean sequences . It remained unpublished for 83 months instead of the normal 18 months.

Title, Independent Claims and Summary

Assignee and licensing information

US 2007/83945 A1

  • Earliest priority - 10 Mar 2000
  • Filed - 2 Aug 2006
  • Publication - 12 Apr 07

PAIR Link

Title - Nucleic acid molecules and other molecules associated with plants

Claim 1

A substantially purified nucleic acid molecule, said nucleic acid molecule capable of specifically hybridizing to a second nucleic acid molecule having a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO: 304905 or complement or fragment thereof.

Claim 8

A substantially purified nucleic acid molecule comprising a nucleic acid molecule or fragment thereof having a pair of defined ends, wherein said pair of defined ends are selected from the defined ends in Table A.

Claim 11

A substantially purified protein or fragment thereof encoded by a first nucleic acid molecule which specifically hybridizes to a second nucleic acid molecule, said second nucleic acid molecule having a nucleic acid sequence selected from the group consisting of SEQ ID NO:1 through SEQ ID NO:304905 or complements thereof.

Claim 12

A transformed plant having a nucleic acid molecule which comprises:

    (A) an exogenous promoter region which functions in a plant cell to cause the production of a mRNA molecule; which is linked to

    (B) a structural nucleic acid molecule, wherein said structural nucleic acid molecule is selected from the group consisting of SEQ ID NO:1 through SEQ ID NO:304905 or complements thereof or fragment of either; which is linked to

    (C) a 3′ non-translated sequence that functions in a plant cell to cause termination of transcription and addition of polyadenylated ribonucleotides to a 3′ end of said mRNA molecule.

Monsanto Co.

Remarks

We have no information that a corresponding patent application has been filed in any other country besides the USA.

US 2007/61916

This patent application covers cotton (Gossypium hirsutum) sequences.

 Publication No.

Title, Independent Claims and Summary

Assignee and licensing information

US 2007/61916 A1

  • Earliest priority - 7 May 2001
  • Filed - 14 Jul 2006
  • Publication - 15 Mar 07

PAIR Link

Title -Nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement

Claim 1

A recombinant polynucleotide selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO: 58,798.

Claim 2

A recombinant polypeptide selected from the group consisting of SEQ ID NO: 58,799 through SEQ ID NO: 117,596.

Claim 3

A method of producing a plant having an improved property, wherein said method comprises transforming a plant with a recombinant construct comprising a promoter region functional in a plant cell operably joined to a polynucleotide comprising a coding sequence for a polypeptide associated with said property, and growing said transformed plant, wherein said polypeptide is selected from the group consisting of:

    a) a polypeptide useful for improving plant cold tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    b) a polypeptide useful for manipulating growth rate in plant cells by modification of the cell cycle pathway, wherein said polypeptide comprises a sequence identified as such in Table 1;

    c) a polypeptide useful for improving plant drought tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    d) a polypeptide useful for providing increased resistance to plant disease, wherein said polypeptide comprises a sequence identified as such in Table 1;

    e) a polypeptide useful for galactomannan production, wherein said polynucleotide comprises a sequence identified as such in Table 1;

    f) a polypeptide useful for production of plant growth regulators, wherein said polypeptide comprises a sequence identified as such in Table 1;

    g) a polypeptide useful for improving plant heat tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    h) a polypeptide useful for improving plant tolerance to herbicides, wherein said polypeptide comprises a sequence identified as such in Table 1;

    i) a polypeptide useful for increasing the rate of homologous recombination in plants, wherein said polypeptide comprises a sequence identified as such in Table 1;

    j) a polypeptide useful for lignin production, wherein said polypeptide comprises a sequence identified as such in Table 1;

    k) a polypeptide useful for improving plant tolerance to extreme osmotic conditions, wherein said polypeptide comprises a sequence identified as such in Table 1;

    l) a polypeptide useful for improving plant tolerance to pathogens or pests, wherein said polypeptide comprises a sequence identified as such in Table 1;

    m) a polypeptide useful for yield improvement by modification of photosynthesis, wherein said polynucleotide comprises a sequence identified as such in Table 1;

    n) a polypeptide useful for modifying seed oil yield and/or content, wherein said polypeptide comprises a sequence identified as such in Table 1;

    o) a polypeptide useful for modifying seed protein yield and/or content, wherein said polypeptide comprises a sequence identified as such in Table 1;

    p) a polypeptide encoding a plant transcription factor, wherein said polypeptide comprises a sequence identified as such in Table 1;

    q) a polypeptide useful for yield improvement by modification of carbohydrate use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1;

    r) a polypeptide useful for yield improvement by modification of nitrogen use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1;

    s) a polypeptide useful for yield improvement by modification of phosphorus use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1; and

    t) a polypeptide useful for yield improvement by providing improved plant growth and development under at least one stress condition, wherein said polypeptide comprises a sequence identified as such in Table 1.

Monsanto Co.

Remarks

We have no information that a corresponding patent application has been filed in any other country besides the USA.

US 2007/236419

This Monsanto patent application claims sequences from soybean, maize and rice, and it is related to a family of patent applications US 2004/31072, US 2004/34888, US 2004/214272, US 2004/216190 and US 2007/11783.

.

Publication No.

Title, Independent Claims and Summary

Assignee and licensing information

US 2006/236419 A1

  • Earliest priority - 6 May 1999
  • Filed - 2 Jul 2003
  • Publication - 19 Oct 06

PAIR Link

Title - Nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement

Claim 1

A recombinant DNA construct comprising a polynucleotide selected from the group consisting of a polynucleotide comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1-3549.

Claim 2

A recombinant DNA construct comprising a polynucleotide selected from the group consisting of a polynucleotide encoding a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 3550-7098.

Claim 3

A method of producing a plant having an improved property, wherein said method comprises transforming a plant with a recombinant construct comprising a promoter region functional in a plant cell operably joined to a polynucleotide comprising coding sequence for a polypeptide associated with said property, and growing said transformed plant, wherein said polypeptide is selected from the group consisting of:

    a) a polypeptide useful for improving plant cold tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    b) a polypeptide useful for manipulating growth rate in plant cells by modification of the cell cycle pathway, wherein said polypeptide comprises a sequence identified as such in Table 1;

    c) a polypeptide useful for improving plant drought tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    d) a polypeptide useful for providing increased resistance to plant disease, wherein said polypeptide comprises a sequence identified as such in Table 1;

    e) a polypeptide useful for galactomannan production, wherein said polynucleotide comprises a sequence identified as such in Table 1;

    f) a polypeptide useful for production of plant growth regulators, wherein said polypeptide comprises a sequence identified as such in Table 1;

    g) a polypeptide useful for improving plant heat tolerance, wherein said polypeptide comprises a sequence identified as such in Table 1;

    h) a polypeptide useful for improving plant tolerance to herbicides, wherein said polypeptide comprises a sequence identified as such in Table 1;

    i) a polypeptide useful for increasing the rate of homologous recombination in plants, wherein said polypeptide comprises a sequence identified as such in Table 1;

    j) a polypeptide useful for lignin production, wherein said polypeptide comprises a sequence identified as such in Table 1;

    k) a polypeptide useful for improving plant tolerance to extreme osmotic conditions, wherein said polypeptide comprises a sequence identified as such in Table 1;

    l) a polypeptide useful for improving plant tolerance to pathogens or pests, wherein said polypeptide comprises a sequence identified as such in Table 1;

    m) a polypeptide useful for yield improvement by modification of photosynthesis, wherein said polynucleotide comprises a sequence identified as such in Table 1;

    n) a polypeptide useful for modifying seed oil yield and/or content, wherein said polypeptide comprises a sequence identified as such in Table 1;

    o) a polypeptide useful for modifying seed protein yield and/or content, wherein said polypeptide comprises a sequence identified as such in Table 1;

    p) a polypeptide encoding a plant transcription factor, wherein said polypeptide comprises a sequence identified as such in Table 1;

    q) a polypeptide useful for yield improvement by modification of carbohydrate use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1;

    r) a polypeptide useful for yield improvement by modification of nitrogen use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1;

    s) a polypeptide useful for yield improvement by modification of phosphorus use and/or uptake, wherein said polypeptide comprises a sequence identified as such in Table 1; and

    t) a polypeptide useful for yield improvement by providing improved plant growth and development under at least one stress condition, wherein said polypeptide comprises a sequence identified as such in Table 1.

Monsanto Co.

Remarks

We have no information that a corresponding patent application has been filed in any other country besides the USA.

Patent application by CropDesign

Patent/ Publication No.

Title, Independent Claims and Summary

Assignee and licensing information

US 2006/21088 A1

  • Earliest priority - 18 Oct 2002 (EP)
  • Filed - 20 Oct 2003
  • Granted - pending
  • Expected expiry - N/A

Title - Identification of novel e2f target genes and use thereof

Claim 1

A method to alter one or more plant characteristics, said method comprising modifying, in a plant, expression of one or more nucleic acids and/or modifying level and/or activity of one or more proteins, which nucleic acids and/or proteins are essentially similar to any one of SEQ ID NO 1 to 2755, and wherein said one or more plant characteristics are altered relative to corresponding wild type plants.

Claim 5

A recombinant nucleic acid comprising:

(a) one or more nucleic acid sequences essentially similar to any one of SEQ ID NO 1 to 2755 or the complement strand thereof; optionally operably linked to (b) a regulatory sequence, and optionally operably linked to (c) a transcription termination sequence

Claim 12

A transgenic plant having one or more altered characteristics when compared to the corresponding wild-type plant, characterized in that said plant has modified expression of one or more nucleic acids and/or modified level and/or activity of one or more proteins, said nucleic acid and/or protein being essentially similar to any one of SEQ ID NO 1 to 2755.

Claim 14

A transgenic plant comprising an isolated nucleic acid and/or protein sequence essentially similar to any one of SEQ ID NO 1 to 2755.

Claim 16

A host cell having one or more altered characteristics when compared to the corresponding wild-type host cell, characterized in that said host cell has modified expression of one or more nucleic acids and/or modified level and/or activity of one or more proteins, said nucleic acid and/or protein being essentially similar to any one of SEQ ID NO 1 to 2755.

Claim 17

Use of a nucleic acid sequence or protein essentially similar to any one of SEQ ID NO 1 to 2755, for altering one or more plant characteristics.

CropDesign N.V.

Remarks

The related applications were also filed in Europe (EP 1551983 A2) and Australia (AU 2003298095).

Granted patents on genes from Arabidopsis

One of the characteristics of bulk sequence claims in plants is that they are almost always in the form of patent applications.  The applicants strategy in claiming bulk sequences is often to establish an early priority date and to later obtain granted patents on individual genes or small groups of genes derived from the initial applications.  Hence , to this point, we have considered only patent applicatons.  Whilst these applications have the potential to influence those working in dicot biotechnology, they are not granted patents.

Transcription Factors

In this section we will look more closely at examples of patents granted on cliams to Arabidopsis Transcription Factors (TFs). 

This group was chosen because it includes patents granted to Mendel Biotechnology Inc., who have made application claims for bulk sequences from Arabidopsis. Since the applications are to transcription factors from Arabidopsis, they may also cover homologous sequences from other dicots.  Thus analysis of granted patents for TFs offers the opportunity to analyse an important group of genes and also the ultimate patent claims that may evolve from bulk sequence claims.

Search Strategy

The patents for Arabidopsis Transcription Factors discussed in this section were identified using the following search strategy.

Search details

Date of search

30 06 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND (transcription AND factor)

Results

2,509

Comments

Of the 2,509 results identified using these search terms, only the top 100 results (based on relevance score) were considered. The content of their titles, abstracts, and claims were used to identify granted patents on Arabidopsis Transcription Factors.  This procedure was in no way exhaustive, and because of the large number of transcription factors in Arabidopsis and their possible value in modifying plant phenotypes it is very likely that many TF patents were not considered in this analysis. The TF patents discussed below are representative of this group of genes from Arabidopsis.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

 A new search was performed to attempt to limit the number of useful identified patent documents:

Search details

Date of search

30 06 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND ((transcription AND factor) in claims) 

Results

92

Comments

Of the 92 results identified using these search terms, only 7 were granted patents of interest. The remainder were applications and/or were not for Arabidopsis genes.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

 A new search was performed designed to identify granted patents from Mendel only:

Search details

Date of search

30 06 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

(Arabidopsis AND thaliana) AND ((SEQ AND ID) in claims) AND (transcription AND factor) AND (mendel AND biotechnology)

Results

40

Comments

Of the 40 results identified using these search terms, only 4 were granted patents of interest.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patents claiming other sequences from Arabidopsis thaliana

The following are some other examples of granted patents for Arabidopsis sequences. Note that there are many more granted patents for Arabidopsis,  far more than we can analyse at present.  We invite interested readers to suggest patented genes and sequences (or a particular area of interest) for further analysis.

The following is an example of a granted patent for an Arabidopsis transcription factor from Korea Kumho Petrochemical Co. This was chosen as a comparison to those granted transcription factor patents from Mendel Biotechnology:

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 5981729

  • Earliest priority - 27 Aug 1998
  • Filed - 27 Aug 1998
  • Granted - 9 Nov 1999
  • Expected expiry - 27 Aug 2018

Title - Transcription factor gene induced by water deficit and abscisic acid isolated from Arabidopsis thaliana

Claim 1

An isolated nucleic acid molecule encoding the transcription factor Athb-12.

The claims are generally drawn to:

  • An isolated nucleic acid molecule encoding the transcription factor Athb-12.

The scope of this claim is limited to the nucleic acid for Athb-12.  The scope of claims are not defined according to similarity or hybridisation language.  No claims are specifically made towards organisms transformed with this sequence or to methods to produce them.  It might be argued that the the doctrine of equivalents allows a wider scope to similar sequences. 
A possible work around here is to use the same transcription factor from a different organism (as long as it too is not too closely related to Athb-12 above). 

Original Applicant:


Korea Kumho Petrochemical Co. Ltd., Seoul, Korea.

Remarks

Note that this granted patent is an earlier patent than those of Mendel's.  It is uncertain whether the limited scope of the claims reflects prior existing art (unlikely for such an early application?) or merely reflects early attempts to patent gene sequences.
This patent is not a member of a large INPADOC family, and arose from a single US application made by a Korean petrochemical company.

Granted Claims

A comparison of the granted claims from the 466' patent and the claims of the parent application demonstrates one important difference between applications and granted claims
The following is claim #1 from the provisional patent application US 60/125,814:

60125814 

Note that the term "...substantially the same DNA binding specificity,..." is difficult to quantify and that the claims are towards sequences from conserved DNA binding domains of transcription factors. Thus these claims are potentially very broad.
Compare the text above with the equivalent claim from the granted US patent US6664446 that evolved from the 814' application:

US664446 B2

Whereas, the application claims refers to a broad group of conserved DNA-binding domains from transcription factors, the granted patent refers specifically to polynucleotides that encode a single protein - SEQ ID NO.18. This protein is a full-length 268-amino-acid transcription factor from Arabidopsis. Hence this claim is very-much limited in scope compared to the application. 

Even-so, it seems likely that this granted claim covers more than a single polnucleotide sequence.

This is due to the following:

Notice also that the 446' patent was filed in 2000, and granted in 2003 and claims benefit of the U.S. Provisional Application Serial No. 60/125,814 filed Mar. 23, 1999. There are at least 17-other patent applications that claim priority  from the same provisional application.  It is possible that other granted patents will arise from this family in the near future.

Summary

An important point demonstrated in this chapter, by comparison with previous chapters, is the difference between bulk sequence applications and the patents that are likely to evolve from them. Granted patents that provide exclusionary rights based on sequences from Arabidopsis are often very-much restricted in claims scope than their parent applications. So, while the claims made in bulk sequence applications by companies such as Paradigm and Mendel may be broad, descendent patents appear to be reasonably limited in scope.

In one sense this is an entirely obvious, and not unreasonable observation.  Patent applicants often attempt in the first instance to claim the maximum range possible, and these claims end up being moderated and limited by rounds of revisions required by patent examiners. This is a common feature of patent applications and is not limited to applications containing sequences.  In fact such a strategy is almost enshrined, if not promoted, by many patenting systems.

As stated in a previous chapter, granted patents are not necessarily the problem (although some may argue with this!).  Instead, it is the uncertainty generated by the ability of large companies to make such bulk sequence applications that may cause problems for those working with Arabidopsis (and flowering plants). The process of setting early priority dates with bulk applications and then keeping these applications pending through chains of continuations, continuations-in-part, and divisional applications, results in the following:

Such uncertainty may not be at the forefront of much not-for-profit research, it will however be pivotal in decisions made by private R&D companies, where FTO is critical.  With the growing need for collaboration between not-for-profits, Universities, and private industry, such uncertainty is a growing concern for university researchers and not-for-profits.

This uncertainty has the potential to prevent innovation - a result which would be in conflict with the intent of the patent system.

Bulk Sequence Claims from other Dicots

As mentioned in previous chapters, one of the problems with bulk sequence claims (not just for researchers but also for patent examiners) is the fact that these applications often try to claim homologous sequences in many other organisms.  This is done typically by the use of such claim language as:

NbClaims

(Note that this is a claim from the Dow Chemical Co. application we discussed in Chapter 6, which contains many Nicotiana benthamiana and "Artificial" sequences )

Although this is not the only way of claiming many related sequences.  It also has the added "advantage" (if you are the applicant!) of being difficult to easily quantify, and hence offers the opportunity of obtaining the broadest possible claim scope.  Those "skilled in the art" will realise that many (possibly only distantly related!)  nucleic acid sequences will hybridise together under low stringency conditions. Further, the term ".., wherein expression of said isolated nucleic acid in a plant results in an altered metabolic characteristic." provides a way of claiming such nucleic acid sequences with utility.

The probability that an applicant will achieve such claims to all 7554 sequences is small.  However, it is likely (and no doubt the applicant expects) that some of these sequences will be claimed in successful future applications. The uncertainty with this claim is two-fold:

At this point we see the connection between such broad bulk-sequence claims in other dicoots (Nicotiana benthamiana claim above) and Arabidopsis.  It is very likely that the bulk of the genes/DNA sequences in claim 1 above will hybridise under low stringency conditions with the corresponding Arabidopsis DNA sequences.  Hence bulk sequence claims for other dicots influence Arabidopsis in the same way that the bulk sequence claims of Paradigm Genetics Inc. would read on sequences from Nicotiana benthamiana!

Bulk claims for Arabidopsis thaliana and Nicotiana benthamiana sequences, may not only interfere with each other,
they may effectively claim sequences from important crop plants such as soya, canola, and potato.

Search Strategy

The search strategy we used in this case was different from those used previously, as it made use of the term "seqdata".  Sequence listings of considerable size (greater than 300 pages, i.e. those likely to represent larger bulk-sequence applications) are now available for separate download from the USPTOs website (PSIPS).  Most recent, but perhaps not all, applications and patents with large sequence listings can thus be identified with the serach term "seqdata". A term that forms part of the referring URL found at the end of such patent documents.  Hence this term offers a way of limiting searches to recent bulk-sequence documents.  Another option available is to go to the USPTO's PSIP site and search the listings via publication date.  This latter option is difficult since there are >1000 patent documents to browse, which contain large sequence listings.   

One of the patents discussed in this chapter were identified using the following search strategy.

Search details

Date of search

21 07 2006

Database searched

Patent Lens, accessible at www.patentlens.net

Type of search

Structured Search

Collections searched

US-A, WO-A, US-B, EP-B, AU-B (add link to "Documents in  Collection" which will take you to a web page that describes the extent of the collection.)

Search terms

seqdata AND plant 

Results

328

Comments

From the 328 results identified using these search terms, we chose several that delt with sequences from dicots.  These were used for comparisons with Arabidopsis patent documents and sequences.

You may find other patents or applications of interest based on your particular circumstances.  We encourage you to go to the Patent Lens to adapt this search to your needs or to update this search.  We invite you to provide comments on patents or applications that you see below or that you identify in your own searches which may be of interest to those reading this landscape, by going to the Add Comment link.

Patent applications for bulk sequences from other dicots

The following patent documents demonstrate attempts to patent large groups of sequences from other dicots.  These applications use claim language such that a successful claim would almost certainly cover large numbers of sequences from other dicots, and perhaps also some from Arabidopsis.  It is therefore useful to investigate briefly the claim contents of such applications - particularly since many of these applications are more recent than those of Paradigm Genetics, and, unlike the applications of Paradigm, appear to be in the process of active patenting efforts by the respective companies.

The claims of the Agrigenesis patent below refer to sequences derived from Eucalyptus grandis and Pinus radiata:

Patent or Publication No.

Title, Independent Claims and Summary

Applicant

US 2005/50583 A1

  • Earliest priority - 11 Jan 2000
  • Filed - 09 Jun 2004
  • Granted - Pending
  • Expected expiry - N/A

Title - Compositions isolated from plant cells and their use in the modification of plant cell signaling

Claim 1

An isolated polynucleotide comprising

  • a sequence selected from the group consisting of: sequences provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974.
Claim 2

An isolated polynucleotide comprising

  • a sequence selected from the group consisting of:
    • (a) complements of the sequence recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (b) reverse complements of the sequence recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (c) reverse sequences of the sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (d) sequences that are 100-mers of a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (e) sequences that are 40-mers of a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; and
    • (f) sequences that are 20-mers of a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974.
Claim 3

An isolated polynucleotide comprising

  • a sequence selected from the group consisting of:
    • (a) sequences having at least 75% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (b) sequences having at least 90% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (c) sequences having at least 95% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974;
    • (d) sequences having at least 98% identity to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; and
    • (e) sequences that hybridize to a sequence of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 under stringent hybridization conditions, wherein the polynucleotide encodes a polypeptide having substantially the same functional activity as a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974.
Claim 5

An isolated polypeptide comprising

  • an amino acid sequence selected from the group consisting of: SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989.
Claim 6

An isolated polypeptide comprising

  • an amino acid sequence selected from the group consisting of:
    • (a) sequences having at least 75% identity to a sequence of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989;
    • (b) sequences having at least 90% identity to a sequence of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989; and
    • (c) sequences having at least 95% identity to a sequence of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989, wherein the polypeptide has substantially the same functional properties as a polypeptide of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989.

The claims are generally drawn to:

  • Polynucleotides cited. (Claim 1)
  • Polynucleotides consisting of: (Claim 2)
    • (a) complements of the sequence recited
    • (b) reverse complements of the sequence recited
    • (c) reverse sequences of the sequences recited
    • (d) sequences that are 100-mers cited sequences
    • (e) sequences that are 40-mers of cited sequences
    • (f) sequences that are 20-mers of cited sequences
  • Polynucleotide consisting of: (Claim 3)
    • (a) sequences having at least 75% identity to cited sequences
    • (b) sequences having at least 90% identity to cited sequences
    • (c) sequences having at least 95% identity to cited sequences
    • (d) sequences having at least 98% identity to cited sequences
    • (e) sequences that hybridize to cited sequences under stringent conditions, and the encoded polypeptide has a similar functional activity to the polypeptide encoded by the cited sequences.
  • Polypeptides consisting of the cited sequences.(Claim 5)
  • Polypeptides consisting of: (Claim 6)
    • (a) sequences having at least 75% identity to a cited sequences
    • (b) sequences having at least 90% identity to cited sequences and
    • (c) sequences having at least 95% identity to cited sequences, where the polypeptide has similar function to the cited polypeptide sequences.

The sequences above were derived from:

  1. Eucalyptus grandis (SEQ ID NO: 2, 8, 9, 11, 15, 18, 19, 21-25, 33, 34, 38, 42, 48-52, 55-58, 67, 131-301, 448-471, 474-478, 848, 850-874, 882-887, 946-952, 960-967 and 971-974)
  2. Pinus radiata (SEQ ID NO: 1, 3-7, 10, 12-14, 16, 17, 20, 26-32, 35-37, 39-41, 43-47, 53, 54, 59-66, 302-447, 472, 473, 479-481, 833-847, 849, 875-881, 888, and 968-970)

Original Applicant:
AGRIGENESIS BIOSCIENCES LIMITED,
Parnell, Auckland, NZ.

Summary

[Summary for overlapping claims from one organism to another.  Leading into and connection with genome mapping of patents in Chapter 10]

The Extent of Sequence Patenting in Arabidopsis

In Chapter 8 we expanded upon the idea that claims originating on sequences from one organism, for example Arabidopsis, may also claim similar sequences from other organisms. In particular claims to sequences from a model organism (that may have little economic importance) may include hybridisation and/or similarity language that may effectively claim sequences from economically important crops. So companies that seek to claim large numbers of genes from Arabidopsis are effectively also attempting to claim similar sequences from important crop plants.

The degree to which such claims may read on other organisms is very difficult to calculate due to:

For each of the points above, we have endevoured to develop a solution, an improvement, or an approximation, that gives a better idea of the state of inter-genome claims.

US Applications that Claim Arabidopsis thaliana Sequences

The plot below shows the total number of Arabidopsis thaliana patent sequence filings (blue bars) as well as the number of applications claiming sequences that match a segment of the the Arabidopsis thaliana genome at least 150bp long (green bars) from 1994 to 2006. Because published US application data was not available prior to  2001, the low number of claimed sequences and sequence filings from 1994 to 2001 may be largely attributed to a lack of data from the USPTO rather than a lack of filings.

This plot shows the average number of sequences referenced in claims since 2001.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

US Arabidopsis thaliana Patent Applications: Number of Sequences Referenced in Claims per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments.  For patent applications that claim Arabidopsis gene sequences, the plots below show the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment.

In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

The Arabidopsis genomic data contains some vector contamination for chomosomes 2 and 3 resulting in some spikes that go off the scale of the chart.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

US Arabidopsis thaliana Patent Applications: Number of Patent Applications that Claim Arabidopsis Sequences per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments. The plots below show the number of US patent applications that claim sequences that have at least a 150 bp match to a 300 kbp genome segment.

In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

US Arabidopsis thaliana Patent Applications: Percent Coverage by Patent Claims per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For each 300 kbp segment, the percent coverage by claims in US patent applications was calculated. For example, if an entire 300 kbp segment was claimed by US patent applications, the plot would read 100% for that segment. The Y axes show the percent coverage, and the X-axes show the region (locus) of the Arabidopsis genome. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

Granted US Arabidopsis thaliana Patents: Number of Sequences Referenced in Claims per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments.  For granted patents that claim Arabidopsis gene sequences, the plots below show the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment.

In addition to initial patent filings, continuation and divisional applications are included in the count, so in some cases, the same sequence is counted multiple times. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

Granted US Arabidopsis thaliana Patents: Percent Coverage by Patent Claims per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments. For each 300 kbp segment, the percent coverage by claims in granted US patents was calculated. For example, if an entire 300 kbp segment was claimed by US patents, the plot would read 100% for that segment. The Y axes show the percent coverage, and the X-axes show the region (locus) of the Arabidopsis genome. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

US Patent Applications on Arabidopsis thaliana

Granted US Patents on Arabidopsis thaliana

Granted US Arabidopsis thaliana Patents: Number of Patent Applications that Claim Arabidopsis Sequences per Genome Segment

For the plots below, the Arabidopsis genome was divided into 300 kbp segments. The plots below show the number of granted US patents that claim sequences that have at least a 150 bp match to a 300 kbp genome segment.

In addition to initial patent filings, continuation and divisional applications are included in the count, so the same sequence may be counted multiple times. Each plot shows data for a single Arabidopsis chromosome.

See the corresponding plots for the rice genome landscape.

Genome Coverage in Applications and Patents by For-Profit Companies

A number of companies have claimed Arabidopsis genes sequences in U.S. patent applications and issued patents. Here we present graphs showing the percentage of each chromosome that is claimed in patent applications and patents assigned to seven different companies.

For the plots below, the percent coverage for each rice chromosome is shown. For example, if the entire sequence of a particular Arabidopsis chromosome were claimed in patents or applications, the plot would read 100% for that chromosome. The Y axes show the percent coverage, and the X-axes show the chromosome of the Arabidopsis genome. Note: the scales of the Y axes have been adjusted to reflect the data for each company.

BASF (includes American Cyanamid)

Applications

Granted Patents

Bayer (includes Aventis, Rhone-Poulenc, Plant Genetic Systems, Hoechst, Proagro, Agrevo, Biogentic, Planttec, Sementes, Nunza, Agrinomics, and Rorer)

Applications

Granted Patents

Dow (includes Mycogen and Dowelanco)

Applications

Applications

DuPont (includes Pioneer)

Applications

Granted Patents

HySeq (includes Nuvelo and Variagenics)

Applications

Monsanto (includes Calgene, Delta Pine, Seminis, Agracetus, Dekalb, Emergent, Produsem, Mahendra, Stoneville, and Pharmacia)

Applications

Granted Patents

Paradigm (includes Icoria)

Applications

Granted Patents

Sembiosys

Applications

Granted Patents

Syngenta (includes Torrey Mesa, Zeneca, Ciba Geigy, Novartis, Advanta, Garst, Agripro, and Danisco)

Applications

Granted Patents

Mapping Arbidopsis Patents to the Arabidopsis Genome

The first steps in understanding the extent to which claims from one organism overlap into the genome of another is to develop the databases, software, and methodology to answer the question.  The next step we chose was to perform the most straightforward analysis, both to test the methodology and to obtain a "baseline" for comparisons of future analyses.  We set out to:

These initial steps were used to attempt to reproduce aspects of the analysis performed by Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.) on the human genome map for Arabidopsis.  Essentially, claimed sequences from US-granted patents were megaBLASTed against the Arabidopsis genome sequences, and sequences of interest (longer than 28bp, with expect values of <1 x 10e-200 and highest bit scores) were identified.  The genome positions of each of these sequences of interest were identified and grouped together into regions.  Each region spans a 300kb region of an Arabidopsis chromosome. The number of patents in each region was then mapped to the region position on the respective chromosome. 

The end result of this analysis was a dataset of patents containing Arabidopsis sequences in their claims, linked to the specific map location of those sequences on the Arabidopsis genome.  The sequences and matched genomic regions are made using relatively highly specific search criteria. Resulting in "exact" matching of claimed sequences to their corresponding genomic sequence. No attempt was made to identify homologous sequences within the genome, which may also be claimed by the patent document.  Hence the maps obtained are likely to under-represent the actual sequence claims in Arabidopsis, from patents claiming Arabidopsis sequences.

Note also:  This analysis does not deal with those sequences that are present in patent applications, and most importantly will not show sequences appearing in the claims section of bulk sequence applications!

Sequences from US Patents Mapped onto the Chromosomes of Arabidopsis

Chr1

By performing a modification of the analysis of Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.), a member of our group was able to map patented sequences  from US granted patents onto the Arabidopsis chromosomes.  The following 5 maps demonstrate the distribution of patents across Arabidopsis chromosomes.

Note carefully the limitations of this analysis:

  • Only granted US patents were analysed
  • Only patent sequence details for DNA sequences were analysed not claims on protein sequences
  • Only sequences recorded in NCBI's Patent database were included (June  2006 version)
  • Only sequences appearing in the claims section were analysed
  • No regard was made of the claim language, or of claim scope (no attempt was made to identify claims to other homologous or related sequences)
  • No regard was given to claims originating from overlap of claims from other species (i.e. we identified only "identical" Arabidopsis sequences from patents).

(see below for more details of these analyses)

(Chromosome 1 = 30Mb)

Chr2

In all chromsome maps the Y-axis is the number of patents found within 300Kb sections of the chromosome map.

Chromosome 2

Chr3

Chromosome 3:

Note:

  • The Y-axis range is 0 to 10 patents
  • The locus map location 13,500,000 is a patent "hot spot" and contains 47 patents (with references to 78 individual GenBank sequences) For more details of this "hot spot" see the text below.
Chr4

Chromosome 4

Note:

Sequences contained in US applications were not included in any of these chromosome maps.

All of the bulk sequence applications discussed in the text below are not included in this map.

Chr5

Chromosome 5

The same Y-axis scale has been used throughout, to allow the user to compare the relative "intensities" of patenting activity on each of the chromosomes.

Different X-axis scales have been used, to allow optimal resolution of patent activity.  The relative sizes of the different chromosomes are represented in the chromosome graphic below the X-axis.

Note:  Because of the nature of the analysis above (see the notes on the right of the maps above), the mapped sequences are likely to greatly under-represent the true degree of patent activity on the Arabidopsis genome!

Analysis of Patenting "Hot Spots" on the Arabidopsis genome map

During the mapping of patent documents to the corresponding regions of the Arabidopsis map it was noticed that there are a number of places on the map that have a higher-than-average patenting activity.

These regions included:

Arabidopsis

Chromosome

Map Position
(Mbases)

Number of Patents

Number of Sequences
Mapped to the region

2

3.6

6

7

3

3.6

10

14

3

13.5

7

78

4

12.9

8

10

From this table further analyses revealed the following:

Arabidopsis Chromosome 3 and the 47-patent hot spot at Map position 13.5Mb:

Analysis of the sequences from these 47 patents (by BLAST at NCBI) demonstrated that a ~450bp fragment from a cloning vector contaminates the chromosome sequence at this position. The large number of patent documents matching at this site are a result of matches with this region from patent sequences unrelated to Arabidopsis sequences. Our method is sensitive to such "contamination" since it relies on almost identical matches with the chromosomal sequences of Arabidopsis to identify patented sequences (and hence patents of interest from Arabidopsis). It is interesting that this sequence contamination has persisted in the Arabidopsis genome sequence and suggests that the methods used to screen for vector contamination are not entirely reliable.

Arabidopsis Chromosome 3: The 10-patent-hot-spot at Map position 3.6Mb:

US patents appearing at this hotspot include those listed in the following table:

Patent
Number

Issue Date

Title
(use/description of sequence claimed)

Status

Applicant

5912415

06-15-1999

Arabidopsis spindly gene, methods of identification and use
(
gene involved in the gibberellin signal transduction)

Expired

Regents of the University of Minnesota

5965793

10-12-1999

Strong early seed-specific gene regulatory region
(seed-specific expression/modifying fatty acid production in seed tissue)

In force

Monsanto Company, Inc

6291742

09-18-2001

Production of hydroxylated fatty acids in genetically modified plants
(plant fatty acyl hydroxylases)

In force

Carnegie Institution of Washington, Monsanto Company Inc.

6310194

10-30-2001

Plant fatty acid hydroxylases
(plant fatty acyl hydroxylases)

In force

Carnegie Institution of Washington, Monsanto Company Inc.

6342658

01-29-2002

Fatty acid desaturases and mutant sequences thereof
(Brassica: mutation in a delta-12 or delta-15 fatty acid desaturase gene)

In force

Cargill, Inc

6372965

04-16-2002

Genes for microsomal delta-12 fatty acid desaturases and hydroxylases from plants
(alteration of plant lipid composition)

In force

E.I. du Pont de Nemours and Co

6642436

11-04-2003

DNA encoding for plant digalactosyldiacylglycerol galactosyltransferase and methods of use
(Arabidopsis: cDNA encoding digalactosyldiacylglycerol galactosyltransferase (DGD1))

In force

Board of Trustees operating Michigan State University

6872872

03-29-2005

Genes for microsomal delta-12 fatty acid desaturases and related enzymes from plants
(Arabidopsis thaliana cDNA for microsomal delta-12 desaturase)

In force

E. I. du Pont de Nemours and Co

6919466

07-19-2005

Genes for microsomal delta-12 fatty acid desaturases and related enzymes from plants
(use of fatty acid desaturase enzymes to modify plant lipid composition)

In force

E. I. du Pont de Nemours and Co

6967243

11-22-2005

Fatty acid desaturases and mutant sequences thereof
(Brassicaceae: plants having at least one mutation that controls levels of unsaturated fatty acids)

In force

Cargill, Inc

Essentially this hotspot refers to (amongst others) claims against fatty acid desaturase (FAD) genes from Arabidopsis and related organisms such as Brassica spp. Some of the sequences mapping to chromosome 3 of Arabidopsis are in fact the FAD gene from rape (Brassica napus). The importance of FAD claims is underlined by the presence of large private companies involved in agritech research and development: Monsanto, Du Pont, and Cargill. Manipulation of FAD levels in plants is one way in which the ratio of saturated to unsaturated oils might be controlled. Such manipulation is of interest due to the human health implications and industrial applications (including biodiesel) possible through the generation of crop plants with modified levels of unsaturated oils. The patents listed above are an example of a small patent thicket that has arisen around an industrially-important gene family (in this case FAD from Arabidopsis and Brassica sp.). Although beyond the scope of the present landscape: It is interesting to speculate that the FAD genes from other species (e.g. Gossypium hirsutum) may also be the focus of patenting activity.

Arabidopsis Chromosome 2 and the 6-patent hot spot at Map position 3.6Mb:

US patents appearing at this hotspot include those listed in the following table:

Patent
Number

Issue Date

Title
(use/description of sequence claimed)

Status

Applicant

6316004

11-13-2001

Chimeric somatostatin containing protein and encoding DNA, plasmids
of expression, method for preparing chimeric protein, strain-producers, immunogenic composition, method for increasing
the productivity of farm animals

Expired

Tikhonenko; T.

6372457

04-16-2002

Process and materials for production of glucosamine
(contains sequence from many expression vectors)

In force

Arkion Life Sciences LLC

6537777

03-25-2003

Human porphobilinogen deaminase sequences

In force

Hemebiotech A/S

6667150

12-23-2003

Method and phage for the identification of nucleic acid sequences encoding members of a multimeric (poly) peptide complex

In force

Morphosys AG

6933146

08-23-2005

Methods and means for producing efficient silencing construct using recombinational cloning

In force

Commonwealth Scientific and Industrial Research Corporation;

6972197

12-06-2005

Plant chromosome compositions and methods
(SEQ ID NO 4: Unknown, expressed Arabidopsis protein. centromeric sequence?)

In force

The University of Chicago

Again, most of the patenst above do not deal with direct claims to sequences from Arabidopsis or other plants. Instead there appears to be some vector contamination in the chr2 sequence from Arabidopsis, and this appears to account for much of the "hotspot" activity here.

Arabidopsis Chromosome 4 and the 8-patent hot spot at Map position 12.9Mb:

US patents appearing at this hotspot include those listed in the following table:

Patent
Number

Issue Date

Title
(use/description of sequence claimed)

Status

Applicant

5856452

01-05-1999

Oil bodies and associated proteins as affinity matrices
(Oleosin from Arabidopsis thaliana)

In force

Sembiosys Genetics Inc

5891859

04-06-1999

Method for regulating cold and dehydration regulatory genes in a plant
(Method defined by part of the CBF1 gene from Arabidopsis)

In force

Michigan State University

5892009

04-06-1999

DNA and encoded protein which regulates cold and dehydration regulated genes
(The CBF1 gene from Arabidopsis)

In force

Michigan State University

6198021

03-06-2001

GA 20-oxidase gene sequences
(Transgenic plants with GA 20-oxidase from Arabidopsis)

In force

Long Ashton Research Station

6417428

07-09-2002

Plant having altered environmental stress tolerance
(transcription factors from Arabidopsis)

In force

Michigan State University/Mendel Biotechnology Inc.

6495742

12-17-2002

Genes encoding plant transcription factors
(Transcription factor from Arabidopsis)

In force

Independent Administrative Institute Japan International
Research Center for Agricultural Sciences

6670528

12-30-2003

Environmental stress-tolerant plants
(Arabidopsis thaliana DREB1A transcription factor)

In force

Independent Administrative Institute, Japan International
Research Center for Agricultural Sciences

6706866

03-16-2004

Plant having altered environmental stress tolerance
(Transcription factor from Arabidopsis)

In force

Michigan State University

(Note that the Applicant in the above tables was that listed in the PatentLens database entry for the given patent.  Where no company name appeared (only inventors names) a further USPTO search was conducted to identify assignee information. Note: These assignments may not be accurate after the date of issue.)

US Applications - Sequence Filings by Year

The plot below shows the the total number of U.S. patent applications containing sequence listings from 2001 through the second quarter of 2005. The plot was constructed with data from applications published since 15 March 2001, when applications were first made available in the XML format. The data do not extend to the current date due to the 18 month lag between when an application is filed and when the application generally publishes.

The graph below shows the average number of sequences referenced in claims over the same time period.

Mapping Claimed Sequences to the Arabidopsis Genome

As we saw in the previous analysis of claimed sequences appearing in granted US patents (US-B patent documents), it is possible to obtain an overview of patent activity in Arabidopsis by:

  1. Generating a query database consisting of the sequences appearing in the claims section of granted US patents,
  2. Generating a target database consisting of the entire sequences of the chromsomes from Arabidopsis,
  3. Performing a megaBLAST analysis with this data (With Expect <= 1 x 10-200 to filter results),
  4. Filtering the megaBLAST results to obtain matches of >=150bp in length (and taking only the match with the highest bit score), and
  5. Marking the approximate positions of these results  onto a map of the Arabidopsis chromosomes.

From this we were able to detect patenting "hot-spots" and to identify various Arabidopsis genes of interest to those involved in exploiting related genes.

One limitation of this analysis is the fact that much of the Arabidopsis sequences appear in patent applications, not in granted patents.  To overcome this limitation a further query database was constructed, consisting of sequences appearing in the claims section of US patent applications (US-A patent documents).  Included in this dataset were sequences from bulk sequence applications for Arabidopsis.

(Note:  The construction of this database required considerable effort.  The database itself can be found here.)

We repeated the analysis performed above on this new dataset, to determine the extent of patent pending claims on the Arabidopsis genome.

 This analysis does not deal with those sequences that are claimed through similarity or hybridisation language.
Only those sequences that are essentially identical to Arabidopsis sequences were mapped.
Sequences from granted US patents were not included in these maps.

Arabidopsis Genome Maps of Sequences appearing in US-patent applications

Chr1

By performing a modification of the analysis of Jensen and Murray (Science. 2005 Oct 14;310(5746):239-40.), a member of our group was able to map sequences appearing in the claims section of US patent applications onto the Arabidopsis chromosomes.  The following 5 maps demonstrate the distribution of sequences across Arabidopsis chromosomes.

Note carefully the limitations of this analysis:

  • Only sequences from US applications were analysed
  • Only patent sequence details for DNA sequences were analysed not claims on protein sequences
  • By our estimates >90% of relevant US-A patent documents have been included in this analysis
  • Only sequences appearing in the claims section were analysed
  • No regard was made of the claim language, or of claim scope (no attempt was made to identify claims to other homologous or related sequences)
  • No regard was given to claims originating from overlap of claims from other species (i.e. we identified only "identical" Arabidopsis sequences from patents).

(see below for more details of these analyses)

(Chromosome 1 = 30Mb)

chr2

In all chromosome maps the Y-axis is the number of patent applications found within 300Kb sections of the chromosome map.

Chromosome 2
The hot-spot at 3.6MB has 1077 sequences from patent applications mapped to it (Y scale = 200 patents)

chr3

Chromosome 3:

Note:

  • The Y-axis range is 0 to 200 patents
  • The locus map location 13,500,000 is a patenting "hot spot" and 1336 sequences from applications are mapped here.
  •  For more details of this "hot spot" see the comments in the previous section.

chr4

Chromosome 4

Note:

Sequences contained in US granted patents were not included in these chromosome maps. Although it is unlikely that the inclusion of such data would change the map significantly.

As far as possible, bulk sequence applications have been included in this analysis.

chr5

Chromosome 5

Different X-, and Y-axis scales has been used throughout, to allow the best possible view of all data.

Note:  Because of the nature of the analysis above (see the notes on the left of the maps above), the mapped sequences are likely to greatly under-represent the true degree of patent activity on the Arabidopsis genome!

Methods for Genome Mapping

In this genome landscape, we have determined the extent to which the Arabidopsis genome sequences are claimed in both issued U.S. patents and U.S. patent applications. Our initial approach was to reproduce aspects of the analysis performed by Jensen and Murray (Science 310:239-40, 2005) in their analysis of the human genome. Our process entailed a number of informatics steps that are outlined below.

In summary, we compiled a database of patent sequences that are claimed in patent applications, and compared these sequences to the published Arabidopsis genome using a BLAST-type interface. We then determined which portions of the claimed sequences had significant homology to sequences in the Arabidopsis genome, and mapped these sequences to their location on the Arabidopsis chromosome. The results of our analysis are shown in the subsequent pages of this landscape.

1. Compilation of a searchable Arabidopsis genome database

We used the most recent Arabidopsis genome sequences from the Arabidopsis Genome Project at NCBI. We then used the formatdb program from NCBI to convert the data to a searchable BLAST database.

2. Compilation of sequence databases for granted patents and patent applications

Applications

For patent applications, we acquired the sequences of the bulk sequence applications from the Publication Site for Issued and Published Sequences (PSIPS) web site. This web site provides sequence listings for U.S. patents and applications that are longer than 300 pages. We also acquired the sequence listings for the non-bulk sequence listings (fewer than 300 pages in length) that are published by the USPTO as an XML document. For each of the listing types (bulk sequence and non-bulk sequence), there was a separate file for nucleotides and amino acids. Data for U.S. applications have only been available since 2001.

The bulk and non-bulk sequence listings were then converted to a common data format (FASTA) and combined to create one database for nucleotide sequences, and one database for amino acid sequences. Additionally, each of these combined databases was converted to a searchable BLAST database for use with CAMBIA's patent sequence search tool.

Granted (Issued) Patents

For granted U.S. Patents, we had a data source that wasn't available for the applications; GenBank at NCBI has a searchable patent database of sequences disclosed in granted patents. To create our granted patents sequence database, we started by acquiring the U.S. patent sequences from GenBank. This required removing all sequences that originated from non-U.S. patents.

We then acquired the sequence listings from the bulk and non-bulk patents in the manner described above in the Applications section. The data from all three sources (GenBank, bulk, and non-bulk) were converted to a common format. We then carried out a filtering step that removed any duplicate sequences in the data provided by GenBank, and the sequences provided by the USPTO (bulk and non-bulk).

The identical process was carried out for nucleotide sequences and amino acid sequences. As with the applications, each of these combined databases was converted to a searchable BLAST database for use with CAMBIA's patent sequence search tool.

3. Identification of sequences that are claimed in granted patents and patent applications

A key feature of our analysis is that we determined which sequences were actually claimed in patents and applications, rather than just disclosed in the specification. To this end, we created  four databases that contain only the sequences that are claimed in patent applications. The four databases created correspond to nucleotide sequences in applications, amino acid sequences in applications, nucleotide sequences in granted patents, and amino acid sequences in granted patents.

We compiled a list of common phrases that are used to identify sequence listings in claims. This step was tricky, as there are many different phrases that patent applicants use to designate sequence listings in claims (see examples of phrases) . Using these phrases, we created a list of sequence ID numbers that are designated in patent claims. We then created four new databases that contain only the sequences that are claimed in applications and patents.

4. BLAST search of Arabidopsis genome database using the claimed sequences as input

After compiling a collection of sequences that are claimed in patents and applications, we then used those sequences to query the Arabidopsis genome database (see step 1) using mega BLAST to identify claimed sequences that have significant homology to sequences in the Arabidopsis genome. We performed this analysis only with the claimed nucleotide sequences in patents and applications.

5. Plotting the results of the analysis

The sequences that had significant homology to sequences in the Arabidopsis genome were plotted three different ways. The criteria for matches in the database was that they have a BLAST E value less than e-200.

  1. Sequence Count. For these plots, the Arabidopsis genome was divided into 300 kbp segments.  For patent applications that claim Arabidopsis gene sequences, we  plotted the number of sequences that match at least a 150 bp fragment of each 300 kbp genome segment. For each sequence, only the highest-scoring genome match was counted.
  2. Patent Count. As with the Sequence Count plots, the Arabidopsis genome was divided into 300 kbp segments. For patent applications that claim Arabidopsis gene sequences, we plotted the number patent applications that claim a 150 base pair or longer fragment of each 300 kbp genome segment. For each sequence, only the highest-scoring genome match was counted.
  3. Percent Genome Coverage. For these plots, we plotted the percentage of each genome segment that was claimed in patent applications. Unlike the previous two plot types, there was no requirement that the fragments that matched the Arabidopsis genome sequences be a minimum length (e.g., 150 base pairs). However, the minimum size for a positive match using the BLAST interface is 26 base pairs, so matches shorter than 26 were not included in the analysis. In addition, if multiple sequences covered the same portion of the genome, that portion of the genome was counted as being covered only once.

The end result of our analysis was a dataset of patents containing Arabidopsis sequences in their claims, linked to the specific map location of those sequences on the Arabidopsis genome. The sequences and matched genomic regions are made using relatively highly specific search criteria. Resulting in "exact" matching of claimed sequences to their corresponding genomic sequence. No attempt was made to identify homologous sequences within the genome, which may also be claimed by the patent document. Hence the maps obtained are likely to under-represent the actual sequence claims in Arabidopsis, from patents claiming Arabidopsis sequences.

How Does Claim Language Affect the Genome Maps?

During the patent application process the applicant generally attempts to achieve the widest possible claims. Often this is facilitated by the use of similarity or hybridisation language such as:

[INSERT A FIGURE EXTRACTED FROM A PATENT APPLICATION - ONE CLAIMING ~60% IDENTICAL SEQUENCES]

The mapping analyses performed above only detect sequences that are ~100% identical to the claimed sequences, and miss altogether those that are less-than ~100% identical. This fact, and the existence of such claim language, implies that there are many-other sequence claims that do not appear on our maps. The maps above are therefore under-estimates of the true patenting activity against the Arabidopsis genome

Remember that the converse is also true: claims based on Arabidopsis genes, which use similarity and hybridisation language,
are likely to be claims to the same or similar genes in other organisms!

Many of the sequences mapped to the chromosomes above will have homologous sequences elsewhere in the genome.  Similarly, sequences from other related-organisms (e.g. Brassica napus, canola) that are <100% identical to the corresponding Arabidopsis gene are probably not present on the map (although some that are almost 100% identical are there already).

The use of hybridisation language also contributes to these cross-genome claims. For example, the following claim would almost certainly claim similar sequences from many other genomes:

LOW Stringency

(Patent Application US20040249146: Nucleic acid compositions conferring altered visual phenotypes, DOW AGROSCIENCES)

Due to the complexity and variation in the claim language it is not possible to accomplish an exhaustive claim-by-claim analysis for all patents.  For this reason we have chosen what we believe to be a reasonable computational approximation of the problem. The choices made with respect to this new method of analysis were:

1. Sequence Database Used as the Query

The query sequences chosen were those DNA sequences that appear in the claims of both US patent applications and granted patents.

2.  Sequence Database Used as the Target

The target sequences for this analysis were the DNA sequences from all 5 Arabidopsis chromsomes.

3. Initial Sequence Comparison

Instead of megaBlast analysis, a method optimised for the fast discovery of almost identical sequences, we chose the Blastn algorithm for this analysis.  Blastn is suited to the recognition of more divergent sequences than megaBlast and we beleive that it is more appropriate for this analysis.

4. Filters and Variables Used For Blastn Analysis

To obtain the largest and most comprehensive dataset for analysis in subsequent steps, we set the Expect Cutoff value to be 1 x 10-20  for Blastn (compared to 1 x 10-200 for the previous analyses).  Furthermore, we retained only those sequence alignments that were >= 150bases in length for analysis (this is the same length used in the previous analyses). This database forms the core for future analyses based on similarity filtering, and can thus be re-anaylsed using various degrees of similarity (see Step 5 below).

5. Similarity Value

At this point we chose what we beleive to be a useful and reasonable definition of patent claims to "similar" sequences.  Thus, 85% identity was chosen as a reasonable value to use in the similarity filter for this analysis. This is based on our experience with patent documents claiming sequences based on sequence identity.  Technically, it is possible to re-filter our dataset using higher or lower valuse of % identity.

6. Discovery of Arabidopsis Sequences with >=85% Identity to the Query Sequence

The filter was applied and alignments with >=85% similarity were identified.  These identified sequences were then aligned over their full length with the corresponding Arabidopsis sequence.  Only those with a full-length alignment score of >= 85% identity were retained.

7. Mapping to the Arabidopsis Genome

Mapping was carried out as in the previous analysis, with matching sequences grouped into 300kb regions and then the totals for these regions mapped as a histogram onto the chromosomes.

Predicting the influence of cross genome claims from Arabidopsis gene patents on important dicot crops: Soybean versus Arabidopsis

Soya Bean (Soybean/Soya/Soy/Glycine max):

Soybean, or Glycine max, has a long history of domestication as a food source, and the seed has been harvested for food for many hundreds of years in Eastern Asia. This long history of domestication has resulted in the existence of a very large number of soybean variaties anbd cultivars. More recently, soybean has become an important global food crop, grown in Europe, Asia, Australia, Africa, and Northern and Southern America.  The plant itself is a legume, and will grow in many soil types, requireing a hot summer growing season. Being a legume, the plant establishes a symbiotic relationship with nitrogen-fixing bacteria and is able to grow in relatively nitrogen-poor soils.

Soybean is rich in both protein and oil.  Present day crops are grown for use mainly in processed foods, with only a relatively minor proportion being comsumed directly as food. Soybean and it's processed components finds its way into products such as breakfast cereals, oils, milk replacements, infant formulas, biodeisel, cosmetics, and animal feed.

Soybean and Biotechnology

Due to its economic importance and growing popularity in processed foods, G. max has been the focus of much research, including breeding studies and the production of GMOs (Genetically Modified Organisms).  As one important example of this:  Monsanto markets a Roundup-ready form of G. max which is resistant to glyphosate-based herbicides (e.g Monsanto's Roundup herbicides).  This modification offers a simple way for farmers to plant seed directly into unploughed fields.  Spraying with glyphosate-based herbicides eliminates competing weeds without the need for field ploughing prior to planting.

Patenting and genomics
The soybean genome is-as-yet in the early stages prior to genome sequencing.  The Purdue University consortium has started early work to develop the base for sequencing the genome of G. max.  Hence this genome is only in the earliest stages of sequencing.  However, there are presently 463,101 nucleotide and 2,615 protein sequences deposited in GenBank at the NCBI (as of 24th July 2006) for Glycine max.  Although this dataset is far less than that expected for the entire genome, it is possible to use the available sequence data to predict the amount of similarity between Glycine max and Arabidopsis thaliana genes and to determine the degree of cross genome claims in patents.

Glycine max has a genome size of 1100Mb, almost 10-times larger than Arabidopsis thaliana, with a diploid chromosome count of 40 (Genetics. 2005 Jul;170(3):1221-30.). It belongs to the pea family, Fabaceae, along with peas, beans, and other legumes.  We chose soybean as the Arbidopsis comparison for this analysis based on the following:

Summary

Goal:  Demonstrate to users through bioinformatics analysis and graphical presentation of data:

Date

DB and settings

Search Terms

Patent

Notes

20060602

PLens Stem-on Def DBs

Plant AND ((SEQ AND ID) in claims)

28,715

Too many

20060602

PLens Stem-on Def DBs

Dicot AND ((SEQ AND ID) in claims)

2,655

too many

20060602

PLens Stem-on Def DBs

Arabidopsis AND ((SEQ AND ID) in claims)

4,945

tm

20060602

PLens Stem-on Def DBs

(thale AND cress) 

143

20060602

PLens Stem-on Def DBs

(thale AND cress) AND arabidopsis

132

11 contain thale AND cress but not Arabidopsis!!!

20060602

PLens Stem-on Def DBs

thale

916

20060602

PLens Stem-on Def DBs

cress

2,149

20060602

PLens Stem-on Def DBs

(mouse AND ear AND cress)

192

20060602

PLens Stem-on Def DBs

(mouse AND ear AND cress) AND arabidopsis

160

32 contain mouse and ear and cress do not have Arabidopsis!!!

20060602

PLens Stem-on Def DBs

arbidopsis OR arabdopsis OR arabidpsis

135

common spelling mistakes?

20060602

PLens Stem-on Def DBs

((arbidopsis OR arabdopsis OR arabidpsis) in abstract) OR ((arbidopsis OR arabdopsis OR arabidpsis) in title)) 

2

in title and abstract (only apps.)

20060602

PLens Stem-on Def DBs

(arbidopsis OR arabdopsis OR arabidpsis) AND arabidopsis

128

7 docs have only the misspelled form!

20060607

PLens Stem-on Def DBs

Arabidopsis AND expressed AND (SEQ AND ID) AND sequence 

9,544

All 9 PG patents are top of the list (sorted via relevance). First 100 patents all seem to have Arabidopsis in title (good)

20060607

PLens Stem-on Def DBs

Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence

3,984

As above, but "better" by reduction in hits(?)

20060607

PLens Stem-on Def DBs

Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST

2,287

All PG Inc patents at top of list, less hits, but now many non-At patents in list (incl human)

20060607

PLens Stem-on Def DBs

Arabidopsis AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims) 

1,476

As above, now with Mendel's app showing, still has long list of human apps following though.

20060607

PLens Stem-on Def DBs

((Arabidopsis in abstract) OR (Arabidopsis in title)) AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims)

52

Bad - Mendel's app removed, no At in title or abstract! (Plant not At!)

20060607

PLens Stem-on Def DBs

(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND expressed AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims)

801

Good, PG and MBs present and some from Monsanto (not At though!)

20060607

PLens Stem-on Def DBs

(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND (expressed AND Arabidopsis) AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims)

699

Good, PG and MBs present, Monsanto gone from top 100.

20060607

PLens Stem-on Def DBs (filing date limit 1998 onwards)

(((Arabidopsis OR plant) in abstract) OR ((Arabidopsis OR plant) in title)) AND (expressed AND Arabidopsis) AND ((SEQ AND ID) in claims) AND sequence AND BLAST AND (polypeptide in claims)

657

Little improvement

20060608

PLens Stem-on Def DBs

(((plant OR dicot OR arabidopsis) in abstract) OR ((plant OR dicot OR arabidopsis) in title))

75,964

Huge!

20060608

PLens Stem-on Def DBs

(((plant OR dicot OR arabidopsis) in abstract) OR ((plant OR dicot OR arabidopsis) in title)) AND NOT ((monocot in abstract) OR (monocot in title))

75,761

Huge, little improvement!

PLens Stem-on Def DBs

PLens Stem-on Def DBs

Glossary

Expressed Sequence Tags (ESTs)[add a comment]

Summary

The availability of the Arabidopsis genome and gene sequences represent an opportunity to investigate the influence of such data on patenting, patenting opportunities, and patenting strategies of stakeholders. Such an investigation is useful, not only to those patenting related "inventions", but also to those who use the data, the model system, or who intend to use similar systems. Whilst other genomes, including those of bacteria (e.g. E. coli), invertebrates (e.g. C. elegans and Drosophila), and the human genome (Jensen and Murray, ref###) are also useful in this regard, the availability of an analysis for a popular plant genome offers many advantages for those in the plant sciences and/or agribusiness. Understanding what has happened in Arabidopsis with respect to patenting allows us to better understand:

Although most of the members of the AGI were not for profits or government institutions, funded through public money, companies such as Monsanto (throught Cereon Genomics Inc) and (###) Perkin Elmer Applied Biosystems (### CHECK THIS) did support their efforts.  Monsanto also sequenced the related Landsberg genome, and has supplied this data to the not-for-profit research community.  Hence the sequencing and sequence data that exists today for Arabidopsis was affectively acheived through collaboration between private and public organisations.  The end result of this collaboration, the genomic sequence, the ESTs, and the genetic markers are available to scientists throughout the world (either directly via public databases such as GenBank and dbEST, or under agreement with Monsanto for the Landsberg data).

However, a number of problematic issues exist with respect to Arabidopsis sequence data, particularly the data that was (and still might be) the focus of bulk sequence claims made by Paradigm Genetics Inc, and to a lesser extent those made by Mendel Biotechnology Inc. and Ceres Inc. Such companies have sought to protect their investment in Arabidopsis sequencing and research through patenting of large groups of genes.  The initial step in such a strategy has been to apply for patent protection over large groups of genes.  This "maximalist" strategy is not uncommon in patenting strategies focused towards other similar technologies, and arguably makes good business sense. Unfortuneately, once applications for large groups of sequences are made, this generates uncertainty for others in the field.

Patent Applications

Almost paradoxically, patent applications may have a similar or greater influence on downstream deliverability of products and services than patents themselves. Compared to granted patents, applications may have the following properties:

One drawback of the ability of applicants to make applications for many hundreds or thousands of genes and/or nucleic acid sequences is the uncertainty this generates in the minds of potential users of the technology. 

Uncertainty can take many forms, and this can result in researcher/strategists asking questions such as:

This uncertainty may form a barrier to use of the technology by others.  Additionally, as we have seen in previous chapters, the strategies used by some applicants seem to rely heavily on establishing an early priority date for bulk sequence applications and then supplying "inventiveness" and "enablement" at some much later date.  It is also possible for owners of the IPR to license technologies that are not-as-yet patented.

These properties of patent applications and the uncertainty generated by them produces a de facto protection for the technology under application.  Hence, although most of the claimed sequences present in bulk sequence applications will never produce a granted patent, they are in effect "protected" by the applicant and some aspects of the monopoly possible under a granted patent (de jure protection) is acquired by them.

Granted Patents

As we have seen in the case of bulk sequence applications, few sequences from applications have made their way into granted patents.  Possibly because applications themselves offer a form of protection that allows income to be derived from licensing without the need for a granted patent.  At this point granted patents appear to hold far less Arabidopsis sequences than patent applications.  This is almost certainly due to the economies involved in making and maintaining applications rather than generating, maintaining and defending granted patents.

Imagine the cost of producing granted patents for 1,800 transcription factors at ~US$20,000 each!
  Then compare with the economy of filing 2 or 3 new continuations each year.

It is unlikley that the originators of the US patent office envisaged that continuation and divisional applications would be used in this way,  since the original system was established essentially to allow monopoly protection of mechanical inventions and not genomic information. Arguably, granted patents offer a far higher degree of certainty for both applicant, licencees, and business than does a patent application.  However, the creation and maintenance  of uncertainty (either wittingly or unwittingly) provides in some cases a more economical business tool to monetise a technology.

Future for this Landscape

Charting an IP landscape surrounding any genome is a large undertaking.  To ensure that this landscape is useful to as many as possible and is functional as soon as possible, we have chosen to analyse briefly only a small part of the IP surrounding the Arabidopsis genome.  The choice to focus our analysis on only patent documents involving bulk sequence claims was made to ensure we could meet these initial goals.  As with any landscape, constant revision and additions are required to keep the cartography accurate and relevant.  With this in mind, we do not intend that this present version of the landscape is finished.  It is instead a first step towards a more detailed landscape. The intention has always been that this document should by dynamic and flexible, and that user comments would help shape future landscape efforts. [add a comment]

Towards this end, the landscape is marked throughout with user commentable regions.  We would be very interested to hear your comments about any of the regions of the landscape which you feel could be improved.  We are particularly interested to hear from people who have first hand experience in these matters (Arabidopsis research, patenting, licensing, and legal aspects).  User feedback will help us in directing future efforts to expand and update the landscape (and, if necessary, make corrections to the existing landscape). [add a comment]

Providing your input and help at an early stage of this project may very well create a landscape that provides yourself and users like you with a useful and valuable resource.[add a comment]

CAMBIA
29th June 2006[add a comment]

The information contained in this page was believed to be correct at the time it was collated. New patents and patent applications, altered status of patents, and case law may have resulted in changes in the landscape. CAMBIA makes no warranty that it is correct or up to date at this time and accepts no liability for any use that might be made of it. Corrections or updates to the information are welcome, please send an email to info@bios.net.