Genomics in the public domain: strategy and  policy

Nature Reviews Genetics 1, 70-74 (2000)

SCIENCE AND SOCIETY

URL: http://www.nature.com/cgi-taf/DynaPage.taf?file=/nrg/journal/v1/n1/full/nrg1000_070a_fs.html

 Rebecca S. Eisenberg   

 Rebecca S. Eisenberg is the Robert & Barbara Luciano Professor of Law at the University of   Michigan Law School, Ann Arbor, Michigan 48109, USA. rse@umich.edu.

 

The public domain has been conspicuous in media accounts of public

  and private sector initiatives to complete the sequence of the human

  genome. The issue of whether the human genome will be freely

  available to the public or privately held as a proprietary resource has

  captured the attention of the scientific, trade and popular press, the

  financial markets, and even heads of state. Although some media

  commentary has framed the issue as a conflict between ethics and

  greed, strategic considerations go a long way towards explaining the

  timing and quality of information disclosures on both sides of the

  public–private divide.

 

  Some descriptions of the relationship between Celera Genomics

  Corporation and the Human Genome Project have painted a

  black-and-white picture of a private firm racing to profit from patents while

  the publicly funded project struggles to keep the genome in the public

  domain. In fact, both sides of the picture are variegated. Even as it has

  built a proprietary database and filed patent applications, Celera has

  repeatedly promised that it will eventually make the raw sequence of the

  human genome available to scientists free of charge1, although the timing

  and details of this commitment are unclear and seem to have shifted. At

  the same time, although the public sponsors of the Human Genome

  Project have consistently affirmed the importance of prompt and free

  public access to raw genomic sequence data ( Box 1), the United States

  government reportedly holds more patents on DNA sequences than any

  private firm2. Public and private strategies for publication and patenting

  have overlapped throughout the brief history of genomics research3. An

  important factor contributing to this convergence has been the policy of

  the United States government, beginning with the passage of the

  Bayh–Dole Act of 1980, to promote the patenting of

  government-sponsored research results4. This policy has added strategic

  complexity to decisions concerning public disclosure of research results, as

  both academic and private institutions have become increasingly mindful

  of the interaction between the public domain and the patent system.

  Meanwhile, the increasing scientific significance of private sector research

  accomplishments has underscored the importance of timely publication of

  research results in both the public and private sectors.

 

  What motivates these different players

  to disclose information to the public? How

  do the missions and priorities of different

  institutions in the public and private

  sectors affect the timing and quality of

  their disclosures?

 

  Reasons for disclosing research results

 

  Scientific recognition and credibility. A

  perennial motivation for publication of

  new research results is to stake claim to

  scientific achievements, thereby

  triggering recognition. Public disclosure

  subjects research results to scrutiny,

  exposing errors and promoting

  confidence in the validity of the results. It

  also establishes a priority date for

  purposes of scientific recognition. These considerations are particularly

  important for controversial research claims and for disputed claims of

  scientific priority in a close race.

 

  The goal of scientific recognition undoubtedly motivates both academic

  and private sector researchers involved in sequencing the human

  genome. The sponsors of these research efforts depend on access to top

  scientific talent and cannot afford to ignore the motivations of scientists to

  achieve recognition in the scientific community. Rivalry for scientific

  recognition has been aggravated in this particular context by past

  statements from each side that the other is pursuing a scientific strategy

  that will not permit satisfactory completion of the job5, 6.

 

  This may be an important reason why Celera, although hoping to profit

  from selling access to proprietary DNA sequence databases, has

  nonetheless consistently promised to make the raw sequence of the

  human genome freely available. Without such public disclosure its claims

  to priority will be impossible for the scientific community to assess.

  Researchers have expressed scepticism in the past about the claimed

  accomplishments of private DNA sequencing firms that do not make their

  data publicly available7. The scientific community gives more credence to

  claims backed by publicly accessible data than to claims backed only by

  press release and rumour.

 

  Scientific credibility is also cited by members of the SNP Consortium (Box

  2) as a motivation for making single nucleotide polymorphisms (SNPs)

  publicly available. The pharmaceutical firms in the SNP Consortium hope to

  use SNPs as pharmacogenomic markers to develop drugs for which safety

  and efficacy depend on genotype. Regulatory approval for such products

  is likely to depend on the reliability of genetic screening tests that predict

  drug responses of particular patients. Consortium members hope that it

  will be easier to win approval if the tests use markers that are in the

  public domain, and are therefore subject to challenge and validation by

  the scientific community8.

 

  Publication to the scientific community is inconsistent with long-term

  secrecy, but it is not inconsistent with patenting. Patent disclosures are

  made freely available to the public once the patent issues under United

  States law, and pending patent applications are made public 18 months

  after their filing dates in most of the world. Even before public disclosure

  through the patent system, institutions may permit scientists to publish

  while still preserving patent rights if they coordinate the timing of

  publication with the filing of patent applications. In most of the world,

  disclosure of an invention in a publication before patent filing results in a

  forfeiture of patent rights9, but United States law permits the filing of a

  patent application up to one year after publication10. So publication of

  research results is no guarantee that the results are free of intellectual

  property claims, nor is forfeiture of patent rights a condition for scientific

  recognition.

 

  Widespread dissemination and use. Apart from concerns about

  recognition and credibility of research claims, some institutions might

  choose to disclose DNA sequence information in publicly available

  databases in order to promote widespread dissemination and use. Free

  access is particularly important for encouraging users with limited financial

  resources (such as academic researchers), who might otherwise be

  unable to gain access to past discoveries for use in future research. In

  addition to eliminating licence fees, free availability minimizes transaction

  costs by eliminating the need for owners and users to find each other and

  negotiate licences.

 

  Apart from advancing the public interest in promoting future research, free

  access might also advance the financial interests of its champions. For

  example, the public sponsors of the Human Genome Project are also likely

  to sponsor future research that makes use of genomic information. If the

  information is held in proprietary databases or can only be used under the

  terms of licence agreements, these research sponsors might expect to

  pay more in the future than they would if the information were in the

  public domain.

 

  Some private research sponsors might also believe that it serves their

  financial interests to promote widespread access to DNA sequence

  information by putting it in the public domain. Life sciences firms that hope

  to profit from developing and selling new products, such as drugs and

  crop seeds, might expect to earn more profits sooner by accelerating

  progress in fundamental biological research, thereby bringing new

  commercial products into view. Rather than trying to do this fundamental

  research themselves — an expensive job at which they have no

  comparative advantage — they might prefer to let academic scientists do

  the research with public funds. To the extent that free access facilitates

  pre-market research in universities, these firms may find their interests

  aligned with the interests of public research sponsors in promoting free

  disclosure of DNA sequence information in the public domain. This may

  explain why Monsanto recently decided to release a rough draft of the rice

  genome in the public domain11, and why Merck was willing to invest in

  university-based research to generate a database of expressed sequence

  tags (ESTs) for the public domain12.

 

  Defeating potential patent claims. Another consideration that seems to

  be motivating public disclosures of genomic information by some

  institutions in both the public and private sectors is a wish to prevent

  patenting of DNA sequences. This seems to be one factor driving the

  requirement that publicly funded investigators deposit all newly identified

  DNA sequences and mutations in the publicly accessible GenBank

  database within 24 hours under the 'Bermuda rules' (Box 3). This

  accelerated timetable, which makes it difficult for grantees to get patent

  applications on file before public disclosure, also leads to the prompt

  creation of 'prior art' that could defeat potential patent claims of others. A

  subsequent inventor cannot patent something that was already publicly

  disclosed before the patent claimant discovered it13.

 

  Of course, the requirement for prompt deposit in the public domain could

  be justified as a way of giving the scientific community the benefit of free

  access to as much sequence information as possible as quickly as

  possible, without invoking an anti-patent motivation. But the Bermuda

  rules are not the only evidence of an anti-patenting norm for raw DNA

  sequence information within the Human Genome Project. The National

  Human Genome Research Institute has explicitly discouraged grantees

  from pursuing such patents ( Box 4).

 

  The creation of prior art may prevent the issuance of patents not only on

  past discoveries that are publicly disclosed, but also on future discoveries

  that become obvious in light of past disclosures14. This raises the

  possibility of publishing early research results as a strategic move to

  pre-empt the patenting of future discoveries by commercial rivals15, 16. It

  might even be possible for a firm that is lagging in a race to forestall the

  patent claims of a swifter rival through publications that enrich the prior

  art enough to limit what may be patented in the future. This possibility

  might explain why it is often the laggards rather than the leaders in DNA

  sequencing races that sing the praises of the public domain. So, for

  example, when Merck decided to sponsor the Merck Genome Initiative, at

  least two private firms already had a significant lead over Merck in

  generating private databases of ESTs17. By putting ESTs in the public

  domain, Merck may have hoped to create prior art that would defeat

  future patent claims by these or other firms to the corresponding

  full-length genes, although it now seems unlikely that this strategy will

  prove successful18, 19. The creation of patent-defeating prior art is an

  acknowledged part of the strategy of the SNP Consortium. Again, the SNP

  Consortium entered the race late, after numerous other private sector

  SNP discovery efforts were well under way20. Under these circumstances,

  patent-defeating publication may have seemed like the best hope for

  Consortium members to preserve future access to information that would

  otherwise become proprietary. But if the patent-defeating goal dominates

  the goal of prompt dissemination of information, prompt publication in the

  public domain may not be the best way to proceed.

 

  In fact the SNP Consortium , in contrast to the Merck Genome Initiative

  and participants in the Human Genome Project that comply with the

  Bermuda rules, does not publish all of its information as quickly as

  possible. Instead, it uses a patent law device called a Statutory Invention

  Registration (SIR) to create prior art while delaying publication ( Box 2).

 

  The mechanism for creating prior art before public release is codified at §

  157 of the United States Patent Act. This provision authorizes the United

  States Patent and Trademark Office (PTO) to publish a patent application

  that has been converted to a SIR, without examining it for patentability, if

  the applicant waives the right to receive a patent on the invention within

  a specified period of time. A SIR has the 'attributes specified for patents',

  but does not include the right to exclude others from making, using,

  selling or importing the invention21. One attribute of a patent is that it is

  effective as prior art for purposes of defeating the patent claims of other

  applicants as of its filing date22, even though it might not be published for

  some time thereafter. It is therefore possible to file a patent application

  describing a discovery, wait a while before converting it to a publicly

  accessible SIR, and have the SIR count as prior art as of its filing date,

  even though the disclosure was not yet published on that date. Through

  the use of this device, the SNP Consortium hopes to create prior art that

  will prevent subsequent inventors from patenting their newly identified

  SNPs, while deferring disclosure of the SNPs until after they have been

  mapped. This strategy for prior art creation combines the benefits of

  disclosure with the benefits of nondisclosure.

 

  Reasons for withholding results

 

  Institutions in both the public and private sectors may have compelling

  reasons for withholding research results from disclosure. How these

  reasons are balanced against the reasons for making disclosure will vary

  depending on the priorities of the institution.

 

  Retaining exclusive access for customers. An obvious reason to

  withhold commercially valuable information from public disclosure is to

  preserve its value for sale to paying customers. Nobody wants to pay for

  something that they can get free. Firms like Incyte Genomics and Human

  Genome Sciences that seek to profit from selling access to proprietary

  databases are therefore understandably reluctant to give the same

  information away free in public databases. Why, then, is Celera promising

  to make the raw sequence of the human genome freely available?

  Sometimes limited disclosure of information in the public domain is

  consistent with selling a proprietary information product that offers further

  value over the public domain version23. Celera's paying customers gain

  access to sequence data before the public-release version becomes

  available, as well as getting annotations and proprietary bioinformatics

  capabilities that will not be released in the public domain.

 

  Avoiding disclosure to rivals. Another reason to withhold information

  from publication is that public disclosure lets your rivals know exactly what

  you have accomplished and gives them the benefit of what you have

  learned so far. This seems to be a concern for both public and private

  sector researchers involved in sequencing the human genome. Celera has

  cited concern that competitors will repackage their data and sell it in

  competition with them to justify restrictions on use of the version of the

  human genome sequence that they promise to make available free of

  charge24. For their part, some participants in the public sector Human

  Genome Project feel aggrieved by Celera's inclusion of data that they

  themselves deposited in GenBank in its claim to have completed the

  sequence of the human genome. Some scientists are particularly

  indignant that Celera's publications might include data deposited in

  GenBank by academic investigators who would not be included as

  co-authors25.

 

  In a race to accumulate information, everything one discloses in the public

  domain becomes available to one's rivals and helps them get ahead. If

  one side makes its data freely available and the other side keeps its data

  secret, the rival that relies on secrecy will always know at least as much

  as the rival that promptly discloses its data.

 

  Sometimes information disclosures will be of more value to the secretive

  rival than to the disclosing rival because of the cumulative value of

  combining public data with private data. Suppose, for example, that two

  rivals, Public University and Private Company, each sequence different

  portions of the same gene. Suppose further that the patent system offers

  more generous protection for full-length genes than for gene fragments.

  (Although the matter is not free from doubt, this seems to be consistent

  with the position of the PTO as reflected in recently disclosed training

  materials for patent examiners in applying the written description and

  utility guidelines.)

 

  If Public University freely discloses its portion of the gene in GenBank,

  Private Company might add that information to the partial sequence it

  already has, quickly complete the full-length sequence, and file a patent

  application that it would not have been in a position to file without the

  Public University disclosure. So prompt disclosure in the public domain can

  be treacherous if your ultimate goal is to keep information freely available.

  Although disclosure creates potentially patent-defeating prior art, it may

  also enhance the value of complementary private information, and

  perhaps even make it easier for rivals to get patents.

 

  This may explain why the SNP Consortium defers disclosure of its newly

  identified SNPs by filing SIRs instead of publishing. As well as delaying

  additions to the proprietary SNP collections of their rivals, the

  deferred-disclosure strategy allows the Consortium to conceal from its

  rivals just what it has accomplished so far, creating uncertainty as to

  which SNPs are worth patenting and which are already in the prior art.

  However, the SIR strategy does not make as much information available

  to the research community as quickly as prompt publication or posting on

  a website would do, leaving those who need prompt access to SNPs with

  nowhere to turn but proprietary collections. But as the SIR strategy

  promises eventual disclosure, those with less urgent needs may be

  content to wait, knowing that the Consortium's SNPs will soon be freely

  available.

 

  If the goal of promoting immediate public access dominates the goal of

  defeating future patent claims, then publication might make more sense

  than patent filing. This may be one reason why the sponsors of the

  Human Genome Project, whose overarching mission is to promote

  research, call on their grantees to deposit their data promptly in GenBank

  in accordance with the Bermuda rules rather than to file patent

  applications and defer disclosure pending conversion to SIRs. But the

  public sector Human Genome Project has paid a price for this policy — it

  has advanced the competitive position of their private sector rivals in the

  race to complete the sequence of the human genome, and may have

  enhanced their patent positions.

 

  Preserving patent rights. A final reason for deferring disclosure of DNA

  sequence information is to preserve the possibility of obtaining viable

  patent rights in the future. This concern may motivate some institutions to

  defer publication in precisely the circumstances that it motivates other

  institutions to make prompt disclosure. The difference depends on

  whether they believe that pre-empting future patents is good or bad.

  Apart from concern about preserving their own patent rights, public

  research sponsors and publicly funded research performers may worry

  that premature public disclosure could prevent them from complying with

  their mandate under the Bayh–Dole Act to promote technology transfer

  and product development by patenting research results. Indeed, this

  concern was cited by former NIH director Bernadine Healy in support of

  the decision to file patent applications on the first ESTs identified by Craig

  Venter when he was at NIH25.

 

  In fact, it does not seem that publication of raw genomic DNA sequence

  will prevent the issuance of patents on genes that are subsequently

  found to lie within that sequence under United States law. The situation in

  Europe is less certain and awaits clarification of national laws in response

  to a 1998 directive of the European Parliament on the legal protection of

  biotechnological inventions (Box 5). Although the patent system has not

  yet resolved many of the legal issues that will determine what portions of

  the human genome may be patented, for the time being there seems to

  be little threat that disclosure of the human genome in the public domain

  will leave future researchers who identify and characterize genes with

  nothing left to patent.

 

  Conclusion

 

  Complex and interrelated strategies for endowing the public domain are

  at work in the field of genomics. These strategies arise out of the varied

  plans of different institutions for extracting value out of genomic

  information, complicated by the interplay of the public domain with the

  patent system. Public disclosure of genomic information advances some

  interests while harming others, with no simple distinction between the

  interests of public and private institutions. Understanding these interests

  might do more to enlighten public policy debates about the importance of

  the public domain in genomics research than appeals to ethical

  imperatives.

 

  Boxes

 

 

 

 

 

  Box 1 | Bill Clinton and Tony Blair on genome data access

 

 

 

 

  United States President, Bill Clinton and United

  Kingdom Prime Minister, Tony Blair issued the

  following statement concerning access to

  genome sequence data, on the 14 March 2000.

 

  "To realize the full promise of this research, raw

  fundamental data on the human genome,

  including the human DNA sequence and its

  variations, should be made freely available to

  scientists everywhere.

 

  Unencumbered access to this information will promote discoveries that

  will reduce the burden of disease, improve health around the world, and

  enhance the quality of life for all humankind. Intellectual property

  protection for gene-based inventions will also play an important role in

  stimulating the development of important new health care products.

 

  We applaud the decision by scientists working on the Human Genome

  Project to release raw fundamental information about the human DNA

  sequence and its variants rapidly into the public domain, and we

  commend other scientists around the world to adopt this policy."

 

  (Photo by Mark Wilson, Newsmakers.)

 

 

 

 

 

 

  Box 2 | The SNP Consortium

 

 

 

 

  Members of the SNP Consortium include the Wellcome Trust, APBiotech,

  AstraZeneca PLC, Aventis, Bayer AG, Bristol-Meyers Squibb Company, F.

  Hoffmann-LaRoche, Glaxo Wellcome PLC, IBM, Motorola, Novartis, Pfizer

  Inc., Searle and SmithKline Beecham PLC.

 

  "The SNP Consortium , Ltd. (TSC) has been formed to advance the field of

  medicine and the development of genetic based diagnostics and

  therapeutics, through the creation of a high density single nucleotide

  polymorphism (SNP) map of the human genome. This map will be freely

  available to all parties (members and non-members) at the same time."

 

  The SNP Consortium candidly describes its intellectual property strategy

  as follows:

 

  "The overall IP objective is to maximize the number of SNPs the [ sic] (1)

  enter the public domain at the earliest possible date, and (2) to be free

  of third-party encumbrances such that the map can be used by all

  without financial or other IP obligations. To meet objective (2), the [SNP

  Consortium] intends to withhold public release of identified SNPs until

  mapping has been achieved to prevent facilitating the patenting of the

  same SNPs by third parties. Mapped SNPs will be publicly released

  quarterly, approximately one quarter after they are identified. The

  intellectual property plan is intended to maintain the priority dates of

  discovery of the unmapped SNPs during the period between identification

  and release, for use as 'prior art'."

 

 

 

 

 

 

  Box 3 | The Bermuda rules

 

 

 

 

  The Bermuda rules derive their name from an agreement entered into at

  the International Strategy Meeting on Human Genome Sequencing held

  in Bermuda on the 25–28 February 1996, sponsored by the Wellcome

  Trust26.

 

  "It was agreed that all human genomic sequence information, generated

  by centres funded for large-scale human sequencing, should be freely

  available and in the public domain in order to encourage research and

  development and to maximise its benefit to society."

 

  The Bermuda rules have been criticized for promoting public disclosure of

  data that have not been checked for accuracy27.

 

 

 

 

 

 

  Box 4 | NHGRI policy on release of human genome sequence data

 

 

 

 

  The National Human Genome Research Institute issued the following

  policy statement on the 7 March 1997.

 

  "In NHGRI's opinion, raw human genomic DNA sequence, in the absence

  of additional demonstrated biological information, lacks demonstrated

  specific utility and therefore is an inappropriate material for patent filing.

  NIH is concerned that patent applications on large blocks of primary

  human genomic DNA sequence could have a chilling effect on the

  development of future inventions of useful products...NHGRI will monitor

  grantee activity in this area to learn whether or not attempts are being

  made to patent large blocks of primary human genomic DNA sequence."

 

 

 

 

 

 

  Box 5 | European Parliament directive on patenting

 

 

 

 

  Directive 98/44/EC of the European Parliament and of the Council of the

  6 July 1998 on the legal protection of biotechnological inventions, Official

  Journal L 213, 30/07/1998 p. 0013–0021 Article 5:

 

       The human body, at the various stages of its formation and

       development, and the simple discovery of one of its elements,

       including the sequence or partial sequence of a gene, cannot

       constitute patentable inventions.

       An element isolated from the human body or otherwise produced

       by means of a technical process, including the sequence or partial

       sequence of a gene, may constitute a patentable invention, even if

       the structure of that element is identical to that of a natural

       element.

       The industrial application of a sequence or a partial sequence of a

       gene must be disclosed in the patent application.

 

 

  Links

 

  COMPANIES

  Celera | Monsanto | Merck | Incyte | Human Genome Sciences

 

  FURTHER INFORMATION

  Human Genome Project | Joint statement by Bill Clinton and Tony Blair |

  The SNP Consortium | The Bermuda rules | National Human Genome

  Research Institute policy on patenting of human genomic sequence |

  Interim utility guidelines for Patent Examiners | Written description

  guidelines for Patent Examiners | European Parliament directive on

  patenting

 

 

 

 

 

  References

 

  1. 

     Venter, J. C. et al. Shotgun sequencing of the human genome. Science

     280, 1540–1542 ( 1998). PubMed

  2. 

     King, R. T. Jr Code green: Gene quest will bring glory to some; Incyte

     will stick with cash . The Wall Street Journal 10 February (2000 ).

  3. 

     Healy, B. Special report on gene patenting. N. Engl. J. Med. 327,

     664–667 (1992).  PubMed

  4. 

     Eisenberg, R. S. Public research and private development. Virginia Law

     Rev. 82, 1663–1727 (1996).

  5. 

     Waterston, R. & Sulston, J. E. The Human Genome Project: Reaching

     the finish line. Science 287, 53– 54 (1998).

  6. 

     Marshall, E. Claim and counterclaim on the human genome. Science

     288, 242–243 (2000).  PubMed

  7. 

     Nowak, R. The gold bug: Helicobacter pylori; claimed to be the first

     free-living organism genome fully sequenced. Science 267, 173–174

     (1995).

  8. 

     Wade, N. 10 Drug makers join in drive to find diseases' genetic roots.

     The New York Times 15 April (1999).

  9. 

     European Patent Convention, Article 54.

  10. 

     US Code, title 35, § 102(b).

  11. 

     Palevitz, B. A. Rice genome gets a boost: private sequencing effort

     yields rough draft for the public. The Scientist 1 May (2000).

  12. 

     Eisenberg R. S. Intellectual property issues in genomics. Trends

     Biotechnol. 14, 302–307 (1996).  PubMed

  13. 

     US Code, title 35, § 102(a).

  14. 

     US Code, title 35, § 103.

  15. 

     Parchomovsky, G. Publish or perish. Michigan Law Rev. 98, 926–952

     (2000).

  16. 

     Lichtman, D., Baker, S. & Kraus, K. Strategic disclosure in the patent

     system. Vanderbilt Law Rev. (in the press).

  17. 

     Bishop, J. E. Plan may blow lid off secret gene research. Wall Street

     Journal 28 September (1994).

  18. 

     In re Bell, 991 Federal Reporter, 2d series 781 (Court of Appeals for the

     Federal Circuit, 1993).

  19. 

     In re Deuel, 51 Federal Reporter, 3d series 1552 (Court of Appeals for

     the Federal Circuit, 1995).

  20. 

     Marshall, E. Drug firms to create public database of genetic mutations.

     Science 284, 406–407 ( 1999). PubMed

  21. 

     US Code, title 35, § 157(c).

  22. 

     US Code, title 35 § 102(e).

  23. 

     Shapiro, C. & Varian, H. R. Information Rules: A Strategic Guide to the

     Network Economy (Harvard Business School Press, Cambridge,

     Massachusetts, 1998).

  24. 

     Venter, J. C. Clinton and Blair shouldn't destroy our research. Wall

     Street Journal 21 March (2000).

  25. 

     Marshall, E. Talks of public–private deal end in acrimony. Science 287,

     1723–1725 (2000).  PubMed

  26. 

     Bentley, D. R. Genomic sequence data should be released

     immediately and freely in the public domain. Science 274, 533– 534

     (1996). PubMed

  27. 

     Adams, M. D. & Venter, C. J. Should non-peer-reviewed raw DNA

     sequence data be forced on the scientific community? Science 274,

     534–536 (1996).  PubMed

 

  Acknowledgements

 This research has been supported by a grant from the United States Department of Energy.