Data Science

735 Results

number of items to display at a time

Research

Apr 28, 2026

Evaluating Large Language Models' Abilities to Process and Understand Technical Policy Reports

The authors detail their development of a specialized benchmark for evaluating large language models' abilities to process and understand technical policy reports, thus addressing a gap in existing domain-specific evaluation.
Research

Apr 22, 2026

Simpler Is Better for Autograders: Toward Cost-Effective LLM Evaluations for Open-Ended Tasks

A simple rubric-based autograder outperformed more-complex large language model grading methods across benchmarks, often matching or beating nonexpert human graders while cutting down evaluation time and cost.
Research

Mar 31, 2026

A Selection of Implementable Actions to Establish an Air Force Workforce Analytics Center of Excellence

The authors supported the efforts of the Deputy Chief of Staff for Manpower, Personnel, and Resources (AF/A1) to strengthen the workforce analytics enterprise’s contributions into Headquarters Air Force decisionmaking processes.
Research

Mar 30, 2026

Choosing an Analytic Approach: Key Study Design Considerations in State Policy Evaluation

This paper reviews and details methods for state policy evaluation to guide selection of a research approach, based on an evaluation’s setting and available data.
Research

Mar 24, 2026

Artificial General Intelligence Forecasting and Scenario Analysis: State of the Field, Methodological Gaps, and Strategic Implications

The authors synthesize diverse artificial general intelligence forecasting methodologies to help decisionmakers navigate uncertainty about both the timing and nature of advanced artificial intelligence capabilities.
Research Summary

Mar 19, 2026

Unlocking the Tax Code with RAND's Tax Code Analysis Tool

To disentangle the U.S. tax code and help policymakers better understand the effects of proposed tax changes, researchers developed the RAND Tax Code Analysis Tool (CAT). This brief describes current CAT analyses and potential future capabilities.
Tool

Mar 19, 2026

Tax Code Analysis Tool 1.0: Applying Machine Learning to Map the Tax Code

The authors describe a tool to analyze the U.S. tax code by building a graph database of legal text. The Tax Code Analysis Tool consists of a comprehensive graph database mapping Title 26 of the U.S. Code on internal revenue.
Commentary

Mar 5, 2026

Five Ways Quantum Technology Could Shape Everyday Life

Quantum computing has the potential to shape the future. In what areas might this emerging technology have tangible impacts?
Tool

Mar 4, 2026

Tactical Will to Fight Assessment Guide: A How-To Manual for Conducting Tactical Will to Fight Assessments

The objective of this tool is to improve will to fight (W2F) analysis and assessments. The tool gives analysts (information, intelligence, or otherwise) a structured analytic approach for assessing adversary or partner W2F at the tactical level.
Tool

Mar 4, 2026

Polling Rank: A Comparative Judgment Tool

This guide describes a RAND-developed comparative judgment software tool called Polling Rank, which takes a list of text items as an input and outputs a ranking of those items based on pairwise comparisons made by human participants.
Podcast

Feb 27, 2026

Duration 34:17

Breaking Down the Federal Budget: New Tools from RAND

Economist Jeffrey Wenger explains how the RAND Budget Model works, ways it can be used to gain new insights into the federal budget, and why it matters to all Americans.
Tool

Feb 23, 2026

Judge Reliability Harness

RAND researchers developed the Judge Reliability Harness, an open-source library that orchestrates standardized, reproducible evaluations of large language model–based judges through systematic perturbation testing and human-in-the-loop validation.

First
Previous
Page 1 of 62
Next
Last

Data Science

Data Science

Related Topics

Breaking Down the Federal Budget: New Tools from RAND

AI Security Guide and Risk Assessment Tool

Explore RAND’s Work on this Topic

Error

Evaluating Large Language Models' Abilities to Process and Understand Technical Policy Reports

Simpler Is Better for Autograders: Toward Cost-Effective LLM Evaluations for Open-Ended Tasks

A Selection of Implementable Actions to Establish an Air Force Workforce Analytics Center of Excellence

Choosing an Analytic Approach: Key Study Design Considerations in State Policy Evaluation

Artificial General Intelligence Forecasting and Scenario Analysis: State of the Field, Methodological Gaps, and Strategic Implications

Unlocking the Tax Code with RAND's Tax Code Analysis Tool

Tax Code Analysis Tool 1.0: Applying Machine Learning to Map the Tax Code

Five Ways Quantum Technology Could Shape Everyday Life

Tactical Will to Fight Assessment Guide: A How-To Manual for Conducting Tactical Will to Fight Assessments

Polling Rank: A Comparative Judgment Tool

Breaking Down the Federal Budget: New Tools from RAND

Judge Reliability Harness

Oops! An unexpected error has occurred.

RAND Headquarters

U.S. research divisions

International research divisions

Data Science

Data Science

Related Topics

Breaking Down the Federal Budget: New Tools from RAND

AI Security Guide and Risk Assessment Tool

Explore RAND’s Work on this Topic

Error

Evaluating Large Language Models' Abilities to Process and Understand Technical Policy Reports

Simpler Is Better for Autograders: Toward Cost-Effective LLM Evaluations for Open-Ended Tasks

A Selection of Implementable Actions to Establish an Air Force Workforce Analytics Center of Excellence

Choosing an Analytic Approach: Key Study Design Considerations in State Policy Evaluation

Artificial General Intelligence Forecasting and Scenario Analysis: State of the Field, Methodological Gaps, and Strategic Implications

Unlocking the Tax Code with RAND's Tax Code Analysis Tool

Tax Code Analysis Tool 1.0: Applying Machine Learning to Map the Tax Code

Five Ways Quantum Technology Could Shape Everyday Life

Tactical Will to Fight Assessment Guide: A How-To Manual for Conducting Tactical Will to Fight Assessments

Polling Rank: A Comparative Judgment Tool

Breaking Down the Federal Budget: New Tools from RAND

Judge Reliability Harness

Oops! An unexpected error has occurred.