Large Language Model Standards

Loading
loading...

Large Language Model Standards

June 8, 2026
mike@standardsmichigan.com

No Comments

Perhaps the World Ends Here | Joy Harjo

 

The world begins at a kitchen table. No matter what, we must eat to live.
The gifts of earth are brought and prepared, set on the table.
So it has been since creation, and it will go on.
We chase chickens or dogs away from it. Babies teethe at the corners. They scrape their knees under it.
It is here that children are given instructions on what it means to be human.
We make men at it, we make women.
At this table we gossip, recall enemies and the ghosts of lovers.
Our dreams drink coffee with us as they put their arms around our children.
They laugh with us at our poor falling-down selves and as we put ourselves back together once again at the table.
This table has been a house in the rain, an umbrella in the sun.
Wars have begun and ended at this table. It is a place to hide in the shadow of terror.
A place to celebrate the terrible victory.
We have given birth on this table, and have prepared our parents for burial here.
At this table we sing with joy, with sorrow. We pray of suffering and remorse. We give thanks.
Perhaps the world will end at the kitchen table, while we are laughing and crying, eating of the last sweet bite.

 

Standards and benchmarks for evaluating large language models (LLMs). Some of the most commonly used benchmarks and standards include:

  1. GLUE (General Language Understanding Evaluation): GLUE is a benchmark designed to evaluate and analyze the performance of models across a diverse range of natural language understanding tasks, such as text classification, sentiment analysis, and question answering.
  2. SuperGLUE: SuperGLUE is an extension of the GLUE benchmark, featuring more difficult language understanding tasks, aiming to provide a more challenging evaluation for models.
  3. CoNLL (Conference on Computational Natural Language Learning): CoNLL has historically hosted shared tasks, including tasks related to coreference resolution, dependency parsing, and other syntactic and semantic tasks.
  4. SQuAD (Stanford Question Answering Dataset): SQuAD is a benchmark dataset for evaluating the performance of question answering systems. It consists of questions posed on a set of Wikipedia articles, where the model is tasked with providing answers based on the provided context.
  5. RACE (Reading Comprehension from Examinations): RACE is a dataset designed to evaluate reading comprehension models. It consists of English exam-style reading comprehension passages and accompanying multiple-choice questions.
  6. WMT (Workshop on Machine Translation): The WMT shared tasks focus on machine translation, providing benchmarks and evaluation metrics for assessing the quality of machine translation systems across different languages.
  7. BLEU (Bilingual Evaluation Understudy): BLEU is a metric used to evaluate the quality of machine-translated text relative to human-translated reference texts. It compares n-gram overlap between the generated translation and the reference translations.
  8. ROUGE (Recall-Oriented Understudy for Gisting Evaluation): ROUGE is a set of metrics used for evaluating automatic summarization and machine translation. It measures the overlap between generated summaries or translations and reference summaries or translations.

These benchmarks and standards play a crucial role in assessing the performance and progress of large language models, helping researchers and developers understand their strengths, weaknesses, and areas for improvement.

Yann Lecun & Lex Fridman: Limits of LLMs

New topic for us; time only to cover the basics.  We have followed language, generally, however — every month — because best practice discovery and promulgation in conceiving, designing, building, occupying and maintaining the architectural character of education settlements depends upon a common vocabulary.  The struggle to agree upon vocabulary presents an outsized challenge to the work we do.

Large language models hold significant potential for the building construction industry by streamlining various processes. They can analyze vast amounts of data to aid in architectural design, structural analysis, and project management. These models can generate detailed plans, suggest optimized construction techniques, and assist in cost estimation. Moreover, they facilitate better communication among stakeholders by providing natural language interfaces for discussing complex concepts. By harnessing the power of large language models, the construction industry can enhance efficiency, reduce errors, and ultimately deliver better-designed and more cost-effective buildings.

Join us today at the usual hour.  Use the login credentials at the upper right of our home page.

Related:

print(“Python”)

Standards January: Language

Standard for Large Language Model Agent Interface

 

Speech Day

June 8, 2026
mike@standardsmichigan.com
, , , ,
No Comments

Speech Day generally refers to an annual event held at schools in the United Kingdom, particularly private or independent schools, where students showcase their achievements and receive prizes or awards. The exact date of “Speech Day” varies by school and is typically determined by the school’s academic calendar. It is usually held towards the end of the academic year, either in the summer term or in the early autumn term, before students break for the summer holidays.

Westonbirt School

print(“Python”)

June 8, 2026
mike@standardsmichigan.com
, , , ,
No Comments

Active Python Releases

 

“Python is the programming equivalent

of a Swiss Army Knife.”

— Some guy

 

The Python Standard Library

Open source standards development is characterized by very open exchange, collaborative participation, rapid prototyping, transparency and meritocracy.   The Python programming language is a high-level, interpreted language that is widely used for general-purpose programming. Python is known for its readability, simplicity, and ease of use, making it a popular choice for beginners and experienced developers alike.  Python has a large and active community of developers, which has led to the creation of a vast ecosystem of libraries, frameworks, and tools that can be used for a wide range of applications. These include web development, scientific computing, data analysis, machine learning, and more.

Another important aspect of Python is its versatility. It can be used on a wide range of platforms, including Windows, macOS, Linux, and even mobile devices. Python is also compatible with many other programming languages and can be integrated with other tools and technologies, making it a powerful tool for software development.  Overall, the simplicity, readability, versatility, and large community support of Python make it a valuable programming language to learn for anyone interested in software development including building automation.

As open source software, anyone may suggest an improvement to Python(3.X) starting at the link below:

Python Enhancement Program

Python Download for Windows

Python can be used to control building automation systems. Building automation systems are typically used to control various systems within a building, such as heating, ventilation, air conditioning, lighting, security, and more. Python can be used to control these systems by interacting with the control systems through the building’s network or other interfaces.

There are several Python libraries available that can be used for building automation, including PyVISA, which is used to communicate with instrumentation and control systems, and PyModbus, which is used to communicate with Modbus devices commonly used in building automation systems. Python can also be used to develop custom applications and scripts to automate building systems, such as scheduling temperature setpoints, turning on and off lights, and adjusting ventilation systems based on occupancy or other variables. Overall, Python’s flexibility and versatility make it well-suited for use in building automation systems.

Subversion®

Building Automation & Control Networks

Gallery: Graduation Commencement Speeches

June 8, 2026
mike@standardsmichigan.com
, , , , , , , , ,
No Comments

“It is at leaving the college and entering the world that the education of youth begins…

It is less uniform than that of childhood but more dependent on chance, and doubtless more important.

The youth is then attacked by a greater number of sensations: all that surrounds him strikes him,

and strikes him forcibly.”

—  Claude-Adrien Helvétius (A Treatise on Man)

 

Constructor University (formerly, Jacobs University Bremen Germany) Graduation Band: “Freebird”

Intercollegiate Studies Institute | What Makes the West Strong (Sir Roger Scruton)

“It’s hard to think without a future.” | C.P. Snow (The Masters, 1951)

https://en.wikipedia.org/wiki/2028_Summer_Olympics

Spring Sport

June 5, 2026
mike@standardsmichigan.com
No Comments

“When spring came, even the false spring,
there were no problems except where to be happiest”
Ernest Hemingway (A Moveable Feast, 1964)

University of Michigan Sailing Team | Great Lakes

We are consolidating over 10+ years of coverage of sport standards by the season now.  This is our first cut breaking the topic into four separate seasons.  Join us today at the usual hour when we sort through stabilized literature and the codes and standards open for public consultation

Soccer 

Sports, Recreational Facilities & Equipment

Rugby

University of Michigan | Washtenaw County

Rugby

Equestrian

George M Humphrey Equestrian Center ($7M, 2004)

Cricket

Baseball

Baseball Lighting

Sport Lighting

Tennis

New Pickleball & Tennis Courts

Track and Field

University of Colorado | Boulder County

Sports Equipment & Surfaces

Swimming

Uniform Swimming Pool, Spa & Hot Tub Code

Pool, Spa & Recreational Waters

Golf

Green Space

Beach Volleyball

Volleyball Court Lighting

University of Tennessee at Chattanooga

Field Hockey

Stadium & Arena Structural Engineering

 

Sport News

June 5, 2026
mike@standardsmichigan.com
No Comments

Michigan State University | Ingham County

Rocky Mountain Intercollegiate Skiing Association

College Bowl Games

Fernando Mendoza’s post game interview after winning the Big Ten
byu/justletmeregisteryou insports

 

 

 



Michigan Girl, Our Michigan Girl….

Sport Standards

 

 

Mixed Gender Sport by Design

Engineering in Sport



“Rowing is more poetry than sport.” — George Pocock (‘Boys in the Boat’ 2024), a British-born boat builder, rowing coach, and influential figure in American rowing, best known for his craftsmanship of racing shells and his philosophical approach to the sport.

Winter Sport

“There is no greater glory for a man than that which he wins with his own hands and feet.” (Homer, Iliad c. 8th Century BCE)

A novel smart energy management system in sports stadiums

June 5, 2026
mike@standardsmichigan.com
,
No Comments

 

A novel smart energy management system in sports stadiums

Shady S. Refaat, et al

Texas A&M University at Qatar, Qatar Foundation, Doha, Qatar

Professional and collegiate sport venues consume huge electrical energy. Therefore, a smart management of their electric energy is essential for significant energy saving. Accordingly, this paper proposes a novel embedded real-time, smart, and active energy management system to monitor and efficiently manage such huge and typically uncontrolled energy for minimizing energy consumption and cost per day while considering spectators preferences, comfort level in behavioral modification program, and health aspects. This will provide an opportunity for spectators to reduce energy consumption and improve energy efficiency while considering healthcare concept. In addition, the proposed energy management system is equipped with embedded tools to collect and monitor energy information for each stadium’s area. The data are processed and fed to the artificial neural network algorithm that is used for managing and controlling stadium loads. This strategy does not require any change in the conventional stadium electrical panel. The proposed online algorithm yields to improve the overall grid efficiency, reliability, and increase awareness of the importance of energy conservation. Real-Time implementation of the concept is demonstrated and analyzed.


Michigan Lower Peninsula

Swimming Pool Dimensions and Construction

June 5, 2026
mike@standardsmichigan.com
,
No Comments

University of Michigan | Washtenaw County

About Last Night: #Paris2024

A standard Olympic-sized swimming pool is defined by the following dimensions:

  • Length: 50 meters
  • Width: 25 meters
  • Depth: A minimum of 2 meters
  • Lanes: 10 lanes, each 2.5 meters wide

The total area of the pool is therefore 1,250 square meters, and it holds approximately 2,500 cubic meters (or 2.5 million liters) of water.

https://standardsmichigan.com/australia/

The organization that sets the standards for Olympic-sized pools is the Fédération Internationale de Natation (FINA) — now World Aquatics — the governing body for swimming, diving, water polo, synchronized swimming, and open water swimming. FINA establishes the regulations for the dimensions and equipment of competition pools used in international events, including the Olympic Games.

The top ten universities that have produced Olympic champion:

  1. University of Southern California (USC)
  2. Stanford University
  3. University of California, Berkeley (UC Berkeley)
  4. University of Florida
  5. University of Texas at Austin
  6. University of Michigan – Michael Phelps, the most decorated Olympian of all time.
  7. Indiana University
  8. Auburn University
  9. University of Georgia
  10. University of Arizona

News:

Swim Swam: 2024 Pool “Slow” and not setting records

Paris Olympics swimmers noticing pool is ‘slow’ 

Pool, Spa & Recreational Waters

Swimming, Water Polo and Diving Lighting

Uniform Swimming Pool, Spa & Hot Tub Code

Air Conditioning

June 4, 2026
mike@standardsmichigan.com
, ,
No Comments

Ancient Air Conditioning | CLICK ON IMAGE

Today at 15:00 UTC we will review the latest in best practice literature for air conditioning systems.  Note that we have broken out this topic from the standing Mechanical colloquia.  Our approach features interoperability and system considerations.  Catalogs on the agenda:

ACCA

Air Conditioning System Construction & Maintenance

Air-Conditioning, Heating, and Refrigeration Institute

Standards and Guides

ASHRAE International

Standard 90.1-2022—Energy Standard for Sites and Buildings Except Low-Rise Residential Buildings

Standard 90.4 Energy Standard for Data Centers

Acceptable Performance Standard for District Cooling Systems

ASME

Heating, Ventilating and Air-Conditioning Systems

European Standards

EN 14511 Specifies the requirements for air conditioners, liquid chilling packages, and heat pumps with electrically driven compressors.

IEEE

Occupant-Based HVAC Thermal Setpoints

International Code Council

International Building Code Interior Environment & HVAC Systems

International Mechanical Code Chapter 11 Refrigeration

NFPA

National Electrical Code Article 430: Motors, Motor Circuits and Motor Controllers

Standard for the Installation of Air-Conditioning and Ventilating Systems

Underwriters Laboratories (largely product standards, not embedded system nor interoperability titles)

Uptime Institute

Implementing Data Center Cooling Best Practices


Use the login credentials at the upper right of our home page


University of Rochester Central Utilities Plant Absorption Chiller

Issues: [11-67, 15-124, 15-135, 15-165]

Category: Energy, Mechanical

Colleagues: Mike Anthony, Larry Spielvogel, Richard Robben


 

 

Performance Monitoring for Power Plants

June 4, 2026
mike@standardsmichigan.com

No Comments

“A View of Murton Colliery near Seaham, County Durham” (1843) / John Wilson Carmichael

The American Society of Mechanical Engineers (ASME) has registered a Project Initiation Notification with ANSI to launch a revision to its consensus product ASME PM-202x, Performance Monitoring for Power Plants.  This product should interest stakeholders in involved in college and universities with district energy plants — facility management staffs, consulting engineers, operations and maintenance staff.

From the project prospectus:

These Guidelines cover fossil-fueled power plants, gas-turbine power plants operating in combined cycle, and a balance-of-plant portion including interface with the steam supply system of nuclear power plants.  They include performance monitoring concepts, a description of various methods available, and means for evaluating particular applications.

Since the original publication of these Guidelines in 1993—then limited to steam power plants—the field of performance monitoring (PM) has gained considerable importance.  The lifetime of plant equipment has been improved, while economic demands have increased to extend it even further by careful monitoring.  The PM techniques themselves have also been transformed, largely by the emergence of electronic data acquisition as the dominant method of obtaining the necessary information.

These Guidelines present:

• “Fundamental Considerations”—of PM essentials prior to the actual application, so you enter fully appraised of all the requirements, potential benefits and likelihood of tradeoffs of the PM program. 

• “Program Implementation”—where the concepts of PM implementation, diagnostics and cycle interrelationships have been brought into closer conjunction, bringing you up-to-date with contemporary practice.

• “Case Studies / Diagnostic Examples”—from the large amount of experience and historical data that has been accumulated since 1993.

Intended for employees of power plants and engineers involved with all aspects of power production.

From ANSI’s PINS registry:

Project Need: This document is being developed in order to address performance monitoring and optimization techniques for different power generating facilities. The latest trends and initiatives in performance monitoring as well as practical case studies and examples will be incorporated.

Stakeholders: Designers, producers/manufacturers, owners, operators, consultants, users, general interest, laboratories, regulatory/government, and distributors.

This document will cover power generation facilities including steam generators, steam turbines, and steam turbine cycles (including balance of plant of nuclear facilities), gas turbines, and combined cycles. The guidelines include performance monitoring concepts, a description of various methods available, and means for evaluating particular applications.

No drafts open for public consultation at this time.   The PINS announcement was placed on October 11th*.   The PINS registry is a stakeholder mapping platform that identifies the beginning of a formal process that may interest other accredited, competitor standards developers.   Many ASME consensus products may be indirectly referenced in design guidelines and construction contracts with the statement “Conform to all applicable codes”

The landing page for the ASME standards development enterprise is linked below:

ASME C&S Connect

Note that you will need to set up a (free) account to access this and other ASME best practice titles.

We maintain all ASME consensus products on the standing agenda of our periodic Mechanical and Energy teleconferences.   See our CALENDAR for the next online meeting; open to everyone.

University of Michigan

Issue: [19-148]

Category: District Energy, Energy, Mechanical

Colleagues: Richard Robben, Larry Spielvogel


LEARN MORE:

ANSI Standards Action

Boiler & Pressure Vessel Code

Layout mode
Predefined Skins
Custom Colors
Choose your skin color
Patterns Background
Images Background
Standards Michigan
error: Content is protected !!
Skip to content