In short: AI tools are useful for legal research orientation: identifying relevant legal concepts, locating statutes, and understanding the general direction of case law. They are not reliable for citing specific cases. All current AI models hallucinate case references, including inventing case names, citations, court references, and holdings that do not exist. Every AI-generated citation must be verified through a primary source before use.
Legal research is one of the tasks where AI creates both genuine productivity gains and genuine professional risk. The productivity gain is real: a lawyer who knows the area of law can use AI to quickly identify relevant statutes, find the names of leading cases to then look up properly, and get an initial sense of whether a legal argument has a basis. The risk is equally real: AI tools fabricate specific case citations with the same confidence as accurate ones, and submitting hallucinated citations in legal proceedings has already resulted in court sanctions overseas.
The Hallucination Problem in Legal Research
AI fabricates case citations: In the 2023 US case Mata v. Avianca, a lawyer submitted an AI-generated brief containing multiple fictitious cases, including invented case names, courts, dates, and holdings. The court confirmed the cases did not exist. The lawyer was sanctioned. This was not a unique incident. All current large language models, including Claude, ChatGPT, and Gemini, exhibit this behaviour. The risk is not eliminated by using a more recent or more capable model.
The hallucination problem in legal research has a specific character: AI tools do not flag uncertainty when they fabricate. A hallucinated case citation reads identically to an accurate one. The model generates what a plausible citation would look like based on patterns in its training data. It does not know that the specific case does not exist.
This is a fundamental limitation of how current large language models work, not a quality issue with a specific tool. It affects Claude, ChatGPT, Gemini, and all other general-purpose AI models. Specialist legal AI tools with direct access to verified legal databases (such as Westlaw's AI features or LexisNexis's AI tools) handle this differently because they retrieve from a verified corpus rather than generating from memory. But even these tools can produce errors and require verification.
What AI Does Well in Legal Research
Used with appropriate verification, AI tools can genuinely speed up legal research in the following ways:
- Initial orientation in an unfamiliar area: Asking an AI to explain the general legal framework governing a particular issue is a reasonable starting point. The explanation of concepts and the structure of the law is often accurate at a general level, even when specific citations are not reliable.
- Identifying relevant statutes: AI tools are generally reliable at naming the correct piece of legislation governing a particular area, though you should always verify the current version and confirm it applies in the relevant jurisdiction. Legislation.gov.au and state equivalents are the authoritative sources.
- Understanding the general direction of case law: Asking whether courts have generally accepted or rejected a particular argument can be useful orientation, with the understanding that the AI is describing patterns from its training data and may be out of date or inaccurate on specifics.
- Drafting research questions: AI is useful for helping you articulate the legal issues you need to research, structure a research plan, or identify what you do not yet know.
- Summarising long documents: Summarising a lengthy judgment or legislative explanatory memorandum for initial comprehension is a legitimate use, as long as you read the original before relying on the summary for any material purpose.
What AI Gets Wrong in Legal Research
The tasks where AI most reliably fails in legal research:
- Specific case citations: Do not use AI-generated case citations without verifying each one through AustLII, Jade, Westlaw, or LexisNexis. The hallucination rate on specific citations is high enough that treating any AI-generated citation as unverified is the correct default.
- Recent developments: AI models have training data cutoffs. Cases decided, legislation passed, or regulatory guidance issued after the model's training cutoff will not be known to it. The model may also be unaware of recent changes to areas of law within its training period.
- Jurisdiction-specific Australian law: AI tools trained predominantly on US and UK legal material may apply incorrect principles when the question turns on Australian-specific legislation or case law. This is particularly relevant for state-specific laws, the Privacy Act, and Australian consumer law.
- The holding in a specific case: Even for cases the AI correctly identifies, it may misstate the ratio decidendi, confuse majority and dissenting opinions, or describe a case's significance inaccurately.
A Safe Verification Workflow
This workflow treats AI as a research starting point, not a research endpoint:
- Use AI for orientation only: Ask the AI to explain the legal framework, identify relevant statutes, and describe the general direction of case law. Treat this as a map, not as verified information.
- Extract any case names the AI mentions: List every case name the AI references, treating each as a research lead, not a confirmed citation.
- Verify every case through a primary source: Check each case name on AustLII (austlii.edu.au), Jade (jade.io), Westlaw, or LexisNexis before using it. If the case does not appear, do not use it. Assume it may be fabricated.
- Verify legislation references directly: Confirm section numbers and definitions on legislation.gov.au or the relevant state equivalent. AI frequently cites the correct Act but the wrong section.
- Update for currency: Confirm that the legislation is in its current form and that the cases cited have not been overruled. AI tools will not know about recent developments after their training cutoff.
- Do your own research from the verified starting points: Use the confirmed cases and legislation as the starting point for proper secondary research, not as the end product.
Tools Compared for Legal Research
Claude (Anthropic): Strong at reasoning through legal questions and explaining frameworks. 200,000-token context window handles long judgments and legislation. Same hallucination risk as other models on specific citations. Does not have access to legal databases.
ChatGPT (OpenAI): Widely used, familiar to most practitioners. Same hallucination risk on citations. GPT-4o with browsing enabled can retrieve some current information, but this is not a substitute for verified legal research databases.
Perplexity: Provides source citations with its answers, which makes it easier to check what it is drawing on. Useful for initial research orientation because the sources it cites are visible. Still requires verification; the cited source may not say what Perplexity claims.
Westlaw, LexisNexis, and Jade AI features: These retrieve from verified legal databases rather than generating from memory. They have different (lower) hallucination profiles for case law because they are retrieving from a confirmed corpus. If your practice subscribes to these platforms, use their AI features for case research rather than general-purpose models.
AustLII: Not an AI tool, but the authoritative free source for Australian case law and legislation. Use it to verify every AI-generated reference. austlii.edu.au.
Methodology (Real-World, Verified)
We test AI tools against real SMB workflows: the tasks a 20-person business actually uses AI for, not enterprise demos. Pricing is verified in AUD at the vendor's published rates or converted at current exchange rates. Compliance notes reference the legislation and regulatory guidance relevant to each article's scope. Tools are assessed for suitability by a business with no dedicated IT department.
Related reading: our can staff upload customer data to AI tools.
Can AI replace legal research databases like Westlaw or LexisNexis?
No. General-purpose AI tools like Claude and ChatGPT generate citations from patterns in their training data and cannot verify that the cases they cite exist or that their descriptions of those cases are accurate. Legal research databases like Westlaw, LexisNexis, and Jade retrieve from verified corpora of actual judgments and legislation. AI tools are useful as a research starting point; they do not replace verified primary source research.
What is AI hallucination and why is it a problem in legal research?
AI hallucination occurs when a language model generates plausible-sounding content that is factually incorrect. In legal research, this manifests as fabricated case citations: the AI produces a case name, year, court reference, and summary of the holding that do not correspond to any real case. The model does this with the same confidence as when it accurately describes a real case. The risk is that a researcher who does not verify the citation uses it in a submission or advice, as occurred in the 2023 US Mata v. Avianca matter.
Which AI tool is most reliable for legal research?
All general-purpose AI tools carry the same fundamental hallucination risk on specific case citations. If your firm subscribes to Westlaw, LexisNexis, or Jade, their built-in AI features are generally more reliable for case research because they retrieve from a verified legal corpus rather than generating from memory. Among general-purpose tools, Perplexity is useful because it displays its source citations, making it easier to check what it is drawing on. For Australian case law, AustLII (austlii.edu.au) and Jade (jade.io) are the authoritative free verification sources.
Have any Australian courts issued guidance on AI use in legal proceedings?
Some Australian courts are beginning to issue practice notes and directions on AI use in proceedings and submissions. The position is evolving. Check the practice directions and any standing orders of the specific court before using AI-assisted material in proceedings. The Law Council of Australia is also developing guidance on AI use in legal practice: lawcouncil.asn.au.
How do I verify an AI-generated case citation in Australia?
Search for the case name on AustLII (austlii.edu.au) or Jade (jade.io). If the case does not appear, treat it as fabricated and do not use it. If it does appear, read the judgment directly to confirm that the AI's description of the holding is accurate. Do not rely on the AI's summary of a case, even where the case itself exists. Also confirm the case has not been overruled or distinguished by subsequent decisions.
<p>Looking for a full comparison of which AI tools Australian law firms can use safely, including data handling, pricing, and privilege considerations? Our main guide covers everything in one place.</p>
Read: Best AI Tools for Australian Lawyers