EUNO.NEWS EUNO.NEWS
  • All (21023) +2
  • AI (3157)
  • DevOps (933) +1
  • Software (11078)
  • IT (5806)
  • Education (48)
  • Notice
  • All (21023) +2
    • AI (3157)
    • DevOps (933) +1
    • Software (11078)
    • IT (5806)
    • Education (48)
  • Notice
  • All (21023) +2
  • AI (3157)
  • DevOps (933) +1
  • Software (11078)
  • IT (5806)
  • Education (48)
  • Notice
Sources Tags Search
한국어 English 中文
  • 1 month ago · ai

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Overview An early version of GPT‑4 began performing tasks that previously required human effort, drawing rapid attention. It can solve math problems, write cod...

    #GPT-4 #artificial general intelligence #large language models #AI safety #emergent behavior
  • 1 month ago · ai

    Updating our Model Spec with teen protections

    OpenAI is updating its Model Spec with new Under-18 Principles that define how ChatGPT should support teens with safe, age-appropriate guidance grounded in deve...

    #OpenAI #Model Spec #teen protection #under-18 principles #AI safety #ChatGPT #developmental science #ethical AI
  • 1 month ago · ai

    VAP: A Universal Framework for AI Flight Recorders

    Airplanes Have Flight Recorders. Why Don't AI Systems? On May 6 , 2010, the Dow Jones plunged 1,000 points in minutes—erasing $1 trillion in market value. When...

    #AI provenance #flight recorder #VAP #model auditing #AI safety #transparent logging #verifiable AI
  • 1 month ago · ai

    Addendum to GPT-5.2 System Card: GPT-5.2-Codex

    This system card outlines the comprehensive safety measures implemented for GPT‑5.2-Codex. It details both model-level mitigations, such as specialized safety t...

    #GPT-5.2 #AI safety #prompt injection mitigation #sandboxing #network access control #OpenAI system card
  • 1 month ago · ai

    People Are Paying to Get Their Chatbots High on ‘Drugs’

    An online marketplace is selling code modules that simulate the effects of cannabis, ketamine, cocaine, ayahuasca, and alcohol when they are uploaded to ChatGPT...

    #ChatGPT #AI #chatbot #code modules #drug simulation #AI safety #AI misuse
  • 1 month ago · ai

    안전은 기본, 비용 절감은 덤: 별도 가드레일이 필요한 이유

    들어가며: 가드레일이 뭔가요?AI를 안전하게 사용하기 위한 여러 장치를 통틀어 보통 '가드레일guardrails'이라고 부릅니다. 자동차 주행 중 도로를 벗어나거나 옆 차선을 ......

    #AI safety #guardrails #risk management #AI governance #cost reduction
  • 1 month ago · ai

    Guardrail your LLMs

    !Forem Logohttps://media2.dev.to/dynamic/image/width=65,height=,fit=scale-down,gravity=auto,format=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%...

    #LLM #guardrails #AI safety #prompt engineering #large language models
  • 1 month ago · ai

    Harden your AI systems: Applying industry standards in the real world

    Introduction In the last article, we discussed how integrating AI into business‑critical systems opens up enterprises to a new set of risks with AI security an...

    #AI security #AI safety #industry standards #risk management #cybersecurity #Red Hat #AI governance #threat modeling
  • 1 month ago · it

    Parents call for New York governor to sign landmark AI safety bill

    A group of more than 150 parents sent a letter on Friday to New York governor Kathy Hochul, urging her to sign the Responsible AI Safety and Education RAISE Act...

    #AI safety #RAISE Act #tech policy #legislation #New York #parent advocacy #AI regulation
  • 1 month ago · ai

    Building Trustworthy AI Agents

    The promise of personal AI assistants rests on a dangerous assumption: that we can trust systems we haven’t made trustworthy. We can’t. And today’s versions are...

    #trustworthy AI #AI agents #AI safety #personal AI assistants #AI ethics #security
  • 1 month ago · ai

    Training LLMs for Honesty via Confessions

    Article URL: https://arxiv.org/abs/2512.08093 Comments URL: https://news.ycombinator.com/item?id=46242795 Points: 4 Comments: 1...

    #LLM #AI alignment #honesty #confession prompting #language model training #AI safety
  • 1 month ago · ai

    ChatGPT’s ‘adult mode’ is expected to debut in Q1 2026

    We've seen NSFW offerings from Grok and other chatbots, and OpenAI CEO Sam Altman has been teasing adult content within ChatGPT for a while. Now, we have a time...

    #ChatGPT #adult mode #OpenAI #GPT-5.2 #NSFW #AI safety #large language models

Newer posts

Older posts
EUNO.NEWS
RSS GitHub © 2026