ChatGPT 이미지 인풋

Keep Up WIth AI Revolution

by 달의 언어 2023. 10. 4. 23:44

최근에 빙 이미지 크리에이터의 성능을 실험하느라

미드저니는 뒷전으로 밀려난 느낌입니다.

오늘 ChatGPT의 이미지 인풋 기능이 활성화되어서

사용해 보았습니다.

아래는 찌그러진 주전자를 이미지 크리에이터에서 생성한 것입니다.

프롬프트를 나름 정교하게 넣어서 만들었습니다.

매끈한 주전자보다 찌그러진 주전자를 만들어내기가 훨씬 어려운 것 같습니다.

https://www.bing.com/images/create/a-time-worn-tin-kettle2c-many-dented-sides-and-a-ye/651d5422c1e04a918ae5d753b93ef178?id=4P8EZdlSsq%2f3v2Ytw9vaww%3d%3d&view=detailv2&idpp=genimg&FORM=GCRIDP&mode=overlay

Bing

Bing은 지능적인 검색 기능은 사용자가 원하는 정보를 빠르게 검색하고 보상을 제공합니다.

www.bing.com

그리고 이 이미지를 ChatGPT에 활성화된 이미지 인풋에 넣어 보았습니다.

빨간색 화살표 부분을 클릭하면 아래처럼 창이 열리고 이미지를 선택해서 업로드하면 됩니다.

그러면 놀랍게도 ChatGPT는 굉장히 자세하게 이미지에 대한 설명을 제공합니다.

The image depicts an old, rustic kettle with a rich patina, evidencing its long history and use. Its surface is dotted with intentional or perhaps naturally occurring pits and dimples, and the hue of the metal is a deep, mottled yellow-brown, suggesting the effects of oxidation over time. The kettle's handle and lid are both tarnished, further reinforcing its age and character. Positioned on a wooden surface, possibly a porch or tabletop, it's nestled amid an array of fresh green plants. These plants, characterized by their broad leaves and delicate heart-shaped foliage, bring a softness and vitality to the scene, contrasting beautifully with the kettle's rugged exterior. The ambient lighting, likely from a setting or diffused sun, casts gentle shadows and highlights, adding depth and dimension to the image. It creates an intimate, serene atmosphere, evoking feelings of nostalgia and a slower, more contemplative pace of life. The entire composition is a harmonious blend of nature and artifact, old and new, and suggests a story waiting to be told about the kettle's journey and the hands that have used it.

이 이미지는 녹청이 짙은 낡고 소박한 주전자를 묘사하고 있어 오랜 역사와 사용을 증명합니다. 주전자 표면에는 의도적으로 또는 자연적으로 생긴 구덩이와 보조개가 점점이 박혀 있으며, 금속의 색조는 깊고 얼룩덜룩한 황갈색으로 시간이 지남에 따라 산화되었음을 암시합니다. 주전자의 손잡이와 뚜껑은 모두 변색되어 있어 주전자의 나이와 개성을 더욱 강조합니다. 현관이나 탁자 위와 같은 나무 표면에 놓인 이 주전자는 신록이 우거진 식물들 사이에 자리 잡고 있습니다. 넓은 잎과 섬세한 하트 모양의 잎이 특징인 이 식물들은 주전자의 투박한 외관과 아름다운 대조를 이루며 장면에 부드러움과 활력을 불어넣어 줍니다. 석양이나 산란된 태양으로 인한 주변 조명은 부드러운 그림자와 하이라이트를 연출하여 이미지에 깊이와 입체감을 더합니다. 친밀하고 고요한 분위기를 조성하여 향수를 불러일으키고 느리고 명상적인 삶의 속도를 연상시킵니다. 전체 구성은 자연과 유물, 과거와 현재가 조화롭게 어우러져 주전자의 여정과 주전자를 사용했던 손길에 대한 이야기가 기다리고 있음을 암시합니다. (DeepL번역)

지피티가 제공한 설명을 다시 프롬프트로 활용할 수도 있습니다.

이미지 인풋하여 얻은 지피티의 설명을 다시 프롬프트로 만들어서 이미지를 생성하였다.

물론 조금 다른 이미지가 생성되었지만,

생성형 인공지능의 프롬프트를 역으로 얻어낼 수 있는 좋은 방법인 것은 틀림없습니다.

아래 글은 오픈AI에 올라온 글을 재구성한 것이다.

ChatGPT 이미지를 추가하는 기능이 활성화되었습니다.

https://help.openai.com/en/articles/8400551-image-inputs-for-chatgpt-faq

Image inputs for ChatGPT - FAQ | OpenAI Help Center

Your guide to navigating ChatGPT's new image input feature, from how to use it effectively to understanding its limitations

help.openai.com

🌈 이미지 입력 사용 방법

간단한 첫 걸음으로, 사진을 업로드하고 ChatGPT와 대화를 나눠보세요. 더 자세한 통찰력을 원하신다면, 질문을 던져보시고 대화를 더 많은 이미지로 확장해 보세요.

🌈 이미지 주석 달기

업로드하기 전에 사진 편집 마크업 도구를 사용하여 이미지의 특정 부분에 주석을 달아 ChatGPT의 초점을 조정하세요.

🌈 이미지 입력 사양 (지원되는 파일 유형)

ChatGPT는 유연하게 작동하며 PNG, JPEG, 그리고 애니메이션 없는 GIF 형식과 호환됩니다.

🌈 크기 및 수량 제한

각각의 이미지가 20MB 이하인 것을 확인하세요. 업로드에 문제가 있다면, 이미지의 크기와 수량을 재평가해 보세요.

🌈 한계

ChatGPT는 효율적이지만 제한 없는 것은 아닙니다. 그 성능과 한계를 인식하는 것이 중요합니다.

🌈 일반적인 제한사항들

일본어와 같은 비( 非 )라틴 문자나 파노라마 촬영 같은 경우처럼, ChatGPT도 어려움을 겪는 순간들이 있습니다.

🌈 의료 및 비영어 제한사항

CT 스캔과 같은 의료 영상 및 비라틴 문자가 포함된 이미지 사용은 피하세요.

🌈 시각 및 공간적 제한사항

체스 판 식별과 같은 작업들이 까다롭습니다. 회전된, 상하 반전된 영상들과 중요한 세부 정보가 잘린 경우를 피하세요.

🌈 데이터와 개인 정보 보호

당신의 개인 정보 보호 가 최우선입니다. 콘텐츠 활용 방식, 포함된 사진 등도 마찬가지입니다. 더 깊게 이해하기 위해서는 "데이터 개선 모델 성능에 어떻게 사용되는지"를 참조하세요.

🌈 결론

오늘날의 AI 환경에서, ChatGPT의 이미지 입력 은 혁신적입니다. 한계가 있지만, 그들이 제공하는 기회는 엄청납니다. 깊게 파고들고 실험해 보세요, 하지만 그 제약 사항을 인식하는 것을 잊지 마세요.

🌈 FAQ:

1. ChatGPT는 비디오를 해석할 수 있나요?

아니요, ChatGPT는 현재 정적 이미지 처리만 지원합니다.

2. 모든 플랫폼에서 이미지 입력 기능을 사용할 수 있나요?

네, 웹 및 모바일 플랫폼에서 사용 가능합니다.

3. 내가 올린 사진은 ChatGPT 훈련에 사용되나요?

일반 사용자의 이미지는 모델 성능 개선을 위해 사용될 수 있습니다. 단, ChatGPT Enterprise 콘텐츠는 훈련에 사용되지 않습니다.

4. 모델이 내 이미지를 잘못 해석하면 어떻게 해야 하나요?

이미지가 명확하게 보이도록 하고 모델의 초점을 맞추기 위해 중요한 영역에 주석을 달아보세요.

5. ChatGPT에 어떻게 이미지를 추가하나요?

GPT-4로 설정한 후 프롬프트 영역에서 + 아이콘을 탭하여 이미지 입력을 추가하세요.

Image Inputs for ChatGPT - FAQ

Dive into this comprehensive guide on ChatGPT's Image Inputs and get answers to all your questions.

Introduction

OpenAI's latest feature allows users to integrate images into their conversations with ChatGPT. But how does it benefit you?

What are Image Inputs?

Image Inputs enable ChatGPT to understand and interpret images you incorporate into your interactions. It's like providing vision to your AI companion!

Benefits of Using Image Inputs

Expanding the realm of possibilities, Image Inputs assist in discussing document contents, examining photograph details, or even analyzing visual data.

Getting Started with Image Inputs

Ready for the journey?

How to Use Image Inputs

Basic Use

Start with an easy step: Upload a photo and converse with ChatGPT. Want detailed insights? Just ask and extend your conversation with more images.

Annotating Images

Direct ChatGPT's focus by annotating specific parts of your image using a photo edit markup tool before uploading.

Image Input Specifications

Supported File Types

ChatGPT is adaptable and works with PNG, JPEG, and non-animated GIF formats.

Size and Quantity Limitations

Ensure each image is under 20MB. Issues with uploads? Re-evaluate your image's size and quantity.

Understanding Limitations

ChatGPT is efficient but not without limits. Recognize its capabilities.

Common Limitations

Whether it's handling non-Latin alphabets like Japanese or making sense of panoramic shots, ChatGPT has its moments of challenge.

Medical and Non-English Limitations

Avoid using medical images like CT scans and images with non-Latin text.

Visual and Spatial Limitations

Tasks like identifying chess boards can be tricky. Avoid rotated, upside-down images, and ensure no critical detail is cropped.

Data and Privacy

Your privacy is paramount. The approach to content usage, including images, is consistent. For a deeper understanding, consult "How your data is used to improve model performance."

Conclusion

In today's AI landscape, ChatGPT's Image Inputs are revolutionary. While they have limitations, the opportunities they offer are immense. Delve deep, experiment, but remain aware of its constraints.

FAQs:

Can ChatGPT interpret videos?
No, ChatGPT currently supports processing static images only.
Is the image input feature available on all platforms?
Yes, it's available on web and mobile platforms.
Are my images used to train ChatGPT?
Regular users' images might be used to improve model performance. ChatGPT Enterprise content isn't used for training.
What should I do if the model misinterprets my image?
Ensure clarity in your image and annotate areas for the model's focus.
How can I add images in ChatGPT?
Opt for GPT-4 and tap the + icon in the prompt area to input images.

저작자표시 비영리 변경금지 (새창열림)

'Keep Up WIth AI Revolution' 카테고리의 다른 글

생성형인공지능 혁명: 빙 이미지크리에이터와 달리3 (Dall-E 3)의 차이 (1)	2023.10.05
[ChatGPT] 생성형 AI의 혁명: 달리 3 (Dalle 3) 기능 탐구 (59)	2023.10.05
요즘 완전 대세! 크롬 확장앱 만들기 (노마드코더, traw ai 정리) (0)	2023.09.19
[미드저니] Midjourney Features - 디스코드 (2) (1)	2023.09.14
[미드저니] Midjourney Features (0)	2023.09.14

달의 언어

고정 헤더 영역

메뉴 레이어

메뉴 리스트

검색 레이어

검색 영역

상세 컨텐츠

본문 제목

본문