Distributed Representations of Words and Phrases

Loading
loading...

Distributed Representations of Words and Phrases

June 9, 2026
mike@standardsmichigan.com
No Comments

 

Tomas Mikolov, et. al
Google Inc. Mountain View

Abstract.  The recently introduced continuous Skip-gram model is an efficient method for learning high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships. In this paper we present several extensions that improve both the quality of the vectors and the training speed. By subsampling of the frequent words we obtain significant speedup and also learn more regular word representations. We also describe a simple alternative to the hierarchical softmax called negative sampling.

An inherent limitation of word representations is their indifference to word order and their inability to represent idiomatic phrases. For example, the meanings of “Canada” and “Air” cannot be easily combined to obtain “Air Canada”. Motivated by this example, we present a simple method for finding phrases in text, and show that learning good vector representations for millions of phrases is possible.


 

Large Language Models and Infrastructure Technical Standards

Large Language Models (LLMs) are poised to significantly accelerate and reshape the development of infrastructure standards — including engineering codes, technical specifications for civil works, transportation, energy grids, water systems, and related Standards Development Organization (SDO) processes at ASTM, IEEE, ASABE, ISO, and similar bodies.  This connection traces back to foundational ideas in distributed representations (Hinton et al., Mikolov’s Word2Vec) that powered the transformer revolution, which in turn enabled modern LLMs and the shift from passive generative AI to active, goal-directed agentic AI.

While LLMs will not replace human expertise, consensus-building, or rigorous validation, they will transform traditionally slow, document-heavy workflows into faster, more collaborative, and data-driven processes.

1. Faster Drafting, Summarization, and Gap Analysis

LLMs can rapidly summarize lengthy documents, extract key requirements, identify inconsistencies across related standards, and generate initial draft sections or comparison tables. This is especially valuable for reviewing historical codes, research papers, regulations, and stakeholder inputs.

Infrastructure example: In renewable energy permitting or grid interconnection standards, LLMs excel at processing complex environmental impact statements and regulatory texts to accelerate reviews.

2. Enhanced Requirements Engineering and Consistency Checking

LLMs support formal requirements extraction, flag ambiguities, suggest measurable criteria, and translate between domains. They help maintain alignment between textual standards and digital implementations such as Building Information Modeling (BIM) or simulation tools.

3. Improved Accessibility, Education, and Stakeholder Participation

LLMs make standards more usable by generating plain-language explanations, FAQs, examples, and tailored training materials. They lower barriers for broader participation in SDO committees by helping non-experts understand and contribute to drafts.

4. Domain-Specific Applications in Infrastructure

  • Civil, Structural & Agricultural Engineering: Design ideation, safety analysis, and updating standards for new materials and climate resilience.
  • Permitting & Compliance: Summarizing environmental documents and speeding up infrastructure deployment.
  • Interoperability & Testing: Verification support for software-heavy systems such as smart grids and autonomous infrastructure.

5. Broader Process Changes for SDOs

  • Zero-draft acceleration for preliminary stakeholder review
  • Continuous monitoring for maintenance and timely updates
  • Multi-agent LLM systems for parallel virtual expert review before human consensus

Limitations and Important Caveats

  • “Hallucinations” & Validation: Outputs must always be human-verified, especially in safety-critical areas. Domain-specific fine-tuning and retrieval-augmented generation (RAG) help but are not foolproof.
  • Bias, Copyright & Accountability: Standards demand traceability and consensus; LLMs can introduce subtle biases or IP concerns.
  • Not a Full Replacement: Human judgment remains essential for risk assessment, ethics, and real-world tradeoffs.

Expect 2–5× faster iteration on drafts, superior knowledge management, and more adaptive standards. Early adopters using LLM assisted tools with proper governance will lead the next generation of infrastructure standards development.

Hegemon Fairfield County Connecticut

June 9, 2026
mike@standardsmichigan.com
No Comments

Hubbell Corporation, a leader in electrical and utility solutions, significantly contributes to data center build-outs by providing end-to-end infrastructure products. These include reliable connectivity, structured cabling, wiring devices, enclosures, and modular prefabricated systems for high-density server rooms and power distribution. Through brands like PCX and Hubbell Premise Wiring, it ensures scalability, maximum uptime, and regulatory compliance, backed by a 25-year guarantee. Amid AI-driven demands, Hubbell’s vertically integrated approach supports efficient grid-to-chip power management, enabling faster, resilient expansions for colocation and enterprise facilities.

 

 

Cornbread & Grandma’s Chicken Soup

June 8, 2026
mike@standardsmichigan.com
,
No Comments

Standards Nebraska | Statement of Net Position: $5.191B (Page 26)

WRITTEN BY Kalani Simpson PUBLISHED May 25, 2021

 

Ingredients:

  • 1 5- to 6-pound stewing hen or baking chicken
  • 1 package of chicken wings
  • 3 large onions
  • 1 large sweet potato
  • 3 parsnips
  • 2 turnips
  • 11 to 12 large carrots
  • 5 to 6 celery stems
  • 1 bunch of parsley
  • Salt and pepper to taste

Directions:

  1. Clean the chicken, put it in a large pot and cover it with cold water. Bring the water to boil.
  2. Add the chicken wings, onions, sweet potato, parsnips, turnips and carrots. Boil about 1 and a half hours. Remove fat from the surface as it accumulates.
  3. Add the parsley and celery. Cook the mixture about 45 min. longer.
  4. Remove the chicken. The chicken is not used further for the soup. (The meat makes excellent chicken parmesan.)
  5. Put the vegetables in a food processor until they are chopped fine or pass through a strainer. Both were performed in the present study.
  6. Add salt and pepper to taste.

(Note: This soup freezes well.)  Matzo balls were prepared according to the recipe on the back of the box of matzo meal (Manischewitz).

PRINT Recipe

b

 

Cornbread & Coffee

Large Language Model Standards

June 8, 2026
mike@standardsmichigan.com

No Comments

Perhaps the World Ends Here | Joy Harjo

 

The world begins at a kitchen table. No matter what, we must eat to live.
The gifts of earth are brought and prepared, set on the table.
So it has been since creation, and it will go on.
We chase chickens or dogs away from it. Babies teethe at the corners. They scrape their knees under it.
It is here that children are given instructions on what it means to be human.
We make men at it, we make women.
At this table we gossip, recall enemies and the ghosts of lovers.
Our dreams drink coffee with us as they put their arms around our children.
They laugh with us at our poor falling-down selves and as we put ourselves back together once again at the table.
This table has been a house in the rain, an umbrella in the sun.
Wars have begun and ended at this table. It is a place to hide in the shadow of terror.
A place to celebrate the terrible victory.
We have given birth on this table, and have prepared our parents for burial here.
At this table we sing with joy, with sorrow. We pray of suffering and remorse. We give thanks.
Perhaps the world will end at the kitchen table, while we are laughing and crying, eating of the last sweet bite.

 

Standards and benchmarks for evaluating large language models (LLMs). Some of the most commonly used benchmarks and standards include:

  1. GLUE (General Language Understanding Evaluation): GLUE is a benchmark designed to evaluate and analyze the performance of models across a diverse range of natural language understanding tasks, such as text classification, sentiment analysis, and question answering.
  2. SuperGLUE: SuperGLUE is an extension of the GLUE benchmark, featuring more difficult language understanding tasks, aiming to provide a more challenging evaluation for models.
  3. CoNLL (Conference on Computational Natural Language Learning): CoNLL has historically hosted shared tasks, including tasks related to coreference resolution, dependency parsing, and other syntactic and semantic tasks.
  4. SQuAD (Stanford Question Answering Dataset): SQuAD is a benchmark dataset for evaluating the performance of question answering systems. It consists of questions posed on a set of Wikipedia articles, where the model is tasked with providing answers based on the provided context.
  5. RACE (Reading Comprehension from Examinations): RACE is a dataset designed to evaluate reading comprehension models. It consists of English exam-style reading comprehension passages and accompanying multiple-choice questions.
  6. WMT (Workshop on Machine Translation): The WMT shared tasks focus on machine translation, providing benchmarks and evaluation metrics for assessing the quality of machine translation systems across different languages.
  7. BLEU (Bilingual Evaluation Understudy): BLEU is a metric used to evaluate the quality of machine-translated text relative to human-translated reference texts. It compares n-gram overlap between the generated translation and the reference translations.
  8. ROUGE (Recall-Oriented Understudy for Gisting Evaluation): ROUGE is a set of metrics used for evaluating automatic summarization and machine translation. It measures the overlap between generated summaries or translations and reference summaries or translations.

These benchmarks and standards play a crucial role in assessing the performance and progress of large language models, helping researchers and developers understand their strengths, weaknesses, and areas for improvement.

Yann Lecun & Lex Fridman: Limits of LLMs

New topic for us; time only to cover the basics.  We have followed language, generally, however — every month — because best practice discovery and promulgation in conceiving, designing, building, occupying and maintaining the architectural character of education settlements depends upon a common vocabulary.  The struggle to agree upon vocabulary presents an outsized challenge to the work we do.

Large language models hold significant potential for the building construction industry by streamlining various processes. They can analyze vast amounts of data to aid in architectural design, structural analysis, and project management. These models can generate detailed plans, suggest optimized construction techniques, and assist in cost estimation. Moreover, they facilitate better communication among stakeholders by providing natural language interfaces for discussing complex concepts. By harnessing the power of large language models, the construction industry can enhance efficiency, reduce errors, and ultimately deliver better-designed and more cost-effective buildings.

Join us today at the usual hour.  Use the login credentials at the upper right of our home page.

Related:

print(“Python”)

Standards January: Language

Standard for Large Language Model Agent Interface

 

Speech Day

June 8, 2026
mike@standardsmichigan.com
, , , ,
No Comments

Speech Day generally refers to an annual event held at schools in the United Kingdom, particularly private or independent schools, where students showcase their achievements and receive prizes or awards. The exact date of “Speech Day” varies by school and is typically determined by the school’s academic calendar. It is usually held towards the end of the academic year, either in the summer term or in the early autumn term, before students break for the summer holidays.

Westonbirt School

Gallery: Graduation Commencement Speeches

June 8, 2026
mike@standardsmichigan.com
, , , , , , , , ,
No Comments

“It is at leaving the college and entering the world that the education of youth begins…

It is less uniform than that of childhood but more dependent on chance, and doubtless more important.

The youth is then attacked by a greater number of sensations: all that surrounds him strikes him,

and strikes him forcibly.”

—  Claude-Adrien Helvétius (A Treatise on Man)

 

Constructor University (formerly, Jacobs University Bremen Germany) Graduation Band: “Freebird”

Intercollegiate Studies Institute | What Makes the West Strong (Sir Roger Scruton)

Jerry Seinfeld @ Duke University 2026

 

“It’s hard to think without a future.” | C.P. Snow (The Masters, 1951)

HVDC

June 7, 2026
mike@standardsmichigan.com

No Comments

This content is accessible to paid subscribers. To view it please enter your password below or send mike@standardsmichigan.com a request for subscription details.

https://en.wikipedia.org/wiki/2028_Summer_Olympics

Spring Sport

June 5, 2026
mike@standardsmichigan.com
No Comments

“When spring came, even the false spring,
there were no problems except where to be happiest”
Ernest Hemingway (A Moveable Feast, 1964)

University of Michigan Sailing Team | Great Lakes

We are consolidating over 10+ years of coverage of sport standards by the season now.  This is our first cut breaking the topic into four separate seasons.  Join us today at the usual hour when we sort through stabilized literature and the codes and standards open for public consultation

Soccer 

Sports, Recreational Facilities & Equipment

Rugby

University of Michigan | Washtenaw County

Rugby

Equestrian

George M Humphrey Equestrian Center ($7M, 2004)

Cricket

Baseball

Baseball Lighting

Sport Lighting

Tennis

New Pickleball & Tennis Courts

Track and Field

University of Colorado | Boulder County

Sports Equipment & Surfaces

Swimming

Uniform Swimming Pool, Spa & Hot Tub Code

Pool, Spa & Recreational Waters

Golf

Green Space

Beach Volleyball

Volleyball Court Lighting

University of Tennessee at Chattanooga

Field Hockey

Stadium & Arena Structural Engineering

 

A novel smart energy management system in sports stadiums

June 5, 2026
mike@standardsmichigan.com
,
No Comments

 

A novel smart energy management system in sports stadiums

Shady S. Refaat, et al

Texas A&M University at Qatar, Qatar Foundation, Doha, Qatar

Professional and collegiate sport venues consume huge electrical energy. Therefore, a smart management of their electric energy is essential for significant energy saving. Accordingly, this paper proposes a novel embedded real-time, smart, and active energy management system to monitor and efficiently manage such huge and typically uncontrolled energy for minimizing energy consumption and cost per day while considering spectators preferences, comfort level in behavioral modification program, and health aspects. This will provide an opportunity for spectators to reduce energy consumption and improve energy efficiency while considering healthcare concept. In addition, the proposed energy management system is equipped with embedded tools to collect and monitor energy information for each stadium’s area. The data are processed and fed to the artificial neural network algorithm that is used for managing and controlling stadium loads. This strategy does not require any change in the conventional stadium electrical panel. The proposed online algorithm yields to improve the overall grid efficiency, reliability, and increase awareness of the importance of energy conservation. Real-Time implementation of the concept is demonstrated and analyzed.


Michigan Lower Peninsula

Swimming Pool Dimensions and Construction

June 5, 2026
mike@standardsmichigan.com
,
No Comments

University of Michigan | Washtenaw County

About Last Night: #Paris2024

A standard Olympic-sized swimming pool is defined by the following dimensions:

  • Length: 50 meters
  • Width: 25 meters
  • Depth: A minimum of 2 meters
  • Lanes: 10 lanes, each 2.5 meters wide

The total area of the pool is therefore 1,250 square meters, and it holds approximately 2,500 cubic meters (or 2.5 million liters) of water.

https://standardsmichigan.com/australia/

The organization that sets the standards for Olympic-sized pools is the Fédération Internationale de Natation (FINA) — now World Aquatics — the governing body for swimming, diving, water polo, synchronized swimming, and open water swimming. FINA establishes the regulations for the dimensions and equipment of competition pools used in international events, including the Olympic Games.

The top ten universities that have produced Olympic champion:

  1. University of Southern California (USC)
  2. Stanford University
  3. University of California, Berkeley (UC Berkeley)
  4. University of Florida
  5. University of Texas at Austin
  6. University of Michigan – Michael Phelps, the most decorated Olympian of all time.
  7. Indiana University
  8. Auburn University
  9. University of Georgia
  10. University of Arizona

News:

Swim Swam: 2024 Pool “Slow” and not setting records

Paris Olympics swimmers noticing pool is ‘slow’ 

Pool, Spa & Recreational Waters

Swimming, Water Polo and Diving Lighting

Uniform Swimming Pool, Spa & Hot Tub Code

Layout mode
Predefined Skins
Custom Colors
Choose your skin color
Patterns Background
Images Background
Standards Michigan
error: Content is protected !!
Skip to content