轉換範例¶
此範例說明 torchvision.transforms.v2 模組中提供的一些各種轉換。
from PIL import Image
from pathlib import Path
import matplotlib.pyplot as plt
import torch
from torchvision.transforms import v2
plt.rcParams["savefig.bbox"] = 'tight'
# if you change the seed, make sure that the randomly-applied transforms
# properly show that the image can be both transformed and *not* transformed!
torch.manual_seed(0)
# If you're trying to run that on Colab, you can download the assets and the
# helpers from https://github.com/pytorch/vision/tree/main/gallery/
from helpers import plot
orig_img = Image.open(Path('../assets') / 'astronaut.jpg')
幾何轉換¶
幾何影像轉換是指改變影像幾何屬性的過程,例如其形狀、大小、方向或位置。它涉及將數學運算應用於影像像素或座標,以實現所需的轉換。
Pad¶
Pad
轉換(另請參閱 pad()
)用一些像素值填充所有影像邊框。
padded_imgs = [v2.Pad(padding=padding)(orig_img) for padding in (3, 10, 30, 50)]
plot([orig_img] + padded_imgs)
data:image/s3,"s3://crabby-images/5796c/5796c1ae58ccf2c4d745d801e4d233e9a282a689" alt="plot transforms illustrations"
Resize¶
Resize
轉換(另請參閱 resize()
)調整影像大小。
resized_imgs = [v2.Resize(size=size)(orig_img) for size in (30, 50, 100, orig_img.size)]
plot([orig_img] + resized_imgs)
data:image/s3,"s3://crabby-images/fcb45/fcb45d6623baf784b8e8dee3f5e433b7321f213e" alt="plot transforms illustrations"
CenterCrop¶
CenterCrop
轉換(另請參閱 center_crop()
)在中心裁剪給定的影像。
center_crops = [v2.CenterCrop(size=size)(orig_img) for size in (30, 50, 100, orig_img.size)]
plot([orig_img] + center_crops)
data:image/s3,"s3://crabby-images/48833/488332612d798a0bf5e6436b9a83928cc1876b3c" alt="plot transforms illustrations"
FiveCrop¶
FiveCrop
轉換(另請參閱 five_crop()
)將給定的影像裁剪成四個角落和中心裁剪。
(top_left, top_right, bottom_left, bottom_right, center) = v2.FiveCrop(size=(100, 100))(orig_img)
plot([orig_img] + [top_left, top_right, bottom_left, bottom_right, center])
data:image/s3,"s3://crabby-images/142a3/142a3502b2c971a83c50268f983c1816ccf8a98a" alt="plot transforms illustrations"
RandomPerspective¶
RandomPerspective
轉換(另請參閱 perspective()
)對影像執行隨機透視轉換。
perspective_transformer = v2.RandomPerspective(distortion_scale=0.6, p=1.0)
perspective_imgs = [perspective_transformer(orig_img) for _ in range(4)]
plot([orig_img] + perspective_imgs)
data:image/s3,"s3://crabby-images/029b1/029b13ee5bf51c93baccb0c687b2a1fe4991b800" alt="plot transforms illustrations"
RandomRotation¶
RandomRotation
轉換(另請參閱 rotate()
)以隨機角度旋轉影像。
rotater = v2.RandomRotation(degrees=(0, 180))
rotated_imgs = [rotater(orig_img) for _ in range(4)]
plot([orig_img] + rotated_imgs)
data:image/s3,"s3://crabby-images/aa8c0/aa8c0994cc27b7a3270c8812cb20dc6df0ec4266" alt="plot transforms illustrations"
RandomAffine¶
RandomAffine
轉換(另請參閱 affine()
)對影像執行隨機仿射轉換。
affine_transfomer = v2.RandomAffine(degrees=(30, 70), translate=(0.1, 0.3), scale=(0.5, 0.75))
affine_imgs = [affine_transfomer(orig_img) for _ in range(4)]
plot([orig_img] + affine_imgs)
data:image/s3,"s3://crabby-images/b9a85/b9a85bfd80d6eba271b9f0c119345f84c5185a61" alt="plot transforms illustrations"
ElasticTransform¶
ElasticTransform
轉換(另請參閱 elastic_transform()
)隨機轉換影像中物件的形態,並產生類似看穿水的視覺效果。
elastic_transformer = v2.ElasticTransform(alpha=250.0)
transformed_imgs = [elastic_transformer(orig_img) for _ in range(2)]
plot([orig_img] + transformed_imgs)
data:image/s3,"s3://crabby-images/50fe5/50fe5731710a26577dc6b90c4891b75204a5830d" alt="plot transforms illustrations"
RandomCrop¶
RandomCrop
轉換(另請參閱 crop()
)在隨機位置裁剪影像。
cropper = v2.RandomCrop(size=(128, 128))
crops = [cropper(orig_img) for _ in range(4)]
plot([orig_img] + crops)
data:image/s3,"s3://crabby-images/f7aa8/f7aa8af7067d90ed7362dd5b0a8e3223912af09f" alt="plot transforms illustrations"
RandomResizedCrop¶
RandomResizedCrop
轉換(另請參閱 resized_crop()
)在隨機位置裁剪影像,然後將裁剪後的影像調整為給定大小。
resize_cropper = v2.RandomResizedCrop(size=(32, 32))
resized_crops = [resize_cropper(orig_img) for _ in range(4)]
plot([orig_img] + resized_crops)
data:image/s3,"s3://crabby-images/6afb6/6afb63b351054acdb6e950389e8900b7e8a182fa" alt="plot transforms illustrations"
光度轉換¶
光度影像轉換是指修改影像的光度屬性的過程,例如其亮度、對比度、色彩或色調。這些轉換的應用旨在改變影像的視覺外觀,同時保留其幾何結構。
除了 Grayscale
之外,以下轉換都是隨機的,這表示即使使用相同的轉換實例轉換給定影像,每次也會產生不同的結果。
Grayscale¶
Grayscale
轉換(另請參閱 to_grayscale()
)將影像轉換為灰階
gray_img = v2.Grayscale()(orig_img)
plot([orig_img, gray_img], cmap='gray')
data:image/s3,"s3://crabby-images/f79b8/f79b879cdfbcbbd0fb3b8adfa9e1f632f8376640" alt="plot transforms illustrations"
ColorJitter¶
ColorJitter
轉換隨機變更影像的亮度、對比度、飽和度、色調和其他屬性。
jitter = v2.ColorJitter(brightness=.5, hue=.3)
jittered_imgs = [jitter(orig_img) for _ in range(4)]
plot([orig_img] + jittered_imgs)
data:image/s3,"s3://crabby-images/ee557/ee5576fbb033bd7751407b5cdeca0d880e22a701" alt="plot transforms illustrations"
GaussianBlur¶
GaussianBlur
轉換(另請參閱 gaussian_blur()
)對影像執行高斯模糊轉換。
blurrer = v2.GaussianBlur(kernel_size=(5, 9), sigma=(0.1, 5.))
blurred_imgs = [blurrer(orig_img) for _ in range(4)]
plot([orig_img] + blurred_imgs)
data:image/s3,"s3://crabby-images/187d0/187d022b7fc2e3914dc492a92bab3f37ba296843" alt="plot transforms illustrations"
RandomInvert¶
RandomInvert
轉換(另請參閱 invert()
)隨機反轉給定影像的顏色。
inverter = v2.RandomInvert()
invertered_imgs = [inverter(orig_img) for _ in range(4)]
plot([orig_img] + invertered_imgs)
data:image/s3,"s3://crabby-images/d76d2/d76d2e6dfad8c4956a3c57979551ca6d215eb271" alt="plot transforms illustrations"
RandomPosterize¶
RandomPosterize
轉換(另請參閱 posterize()
)透過減少每個顏色通道的位元數來隨機色調分離影像。
posterizer = v2.RandomPosterize(bits=2)
posterized_imgs = [posterizer(orig_img) for _ in range(4)]
plot([orig_img] + posterized_imgs)
data:image/s3,"s3://crabby-images/f2a6e/f2a6e7e9e5b27b0dd1616070f1d9f620175a5cd2" alt="plot transforms illustrations"
RandomSolarize¶
RandomSolarize
轉換(另請參閱 solarize()
)透過反轉高於閾值的所有像素值來隨機太陽化影像。
solarizer = v2.RandomSolarize(threshold=192.0)
solarized_imgs = [solarizer(orig_img) for _ in range(4)]
plot([orig_img] + solarized_imgs)
data:image/s3,"s3://crabby-images/131d8/131d8ff7c14fda3ef8f777708d8282da2b343d40" alt="plot transforms illustrations"
RandomAdjustSharpness¶
RandomAdjustSharpness
轉換(另請參閱 adjust_sharpness()
)隨機調整給定影像的銳利度。
sharpness_adjuster = v2.RandomAdjustSharpness(sharpness_factor=2)
sharpened_imgs = [sharpness_adjuster(orig_img) for _ in range(4)]
plot([orig_img] + sharpened_imgs)
data:image/s3,"s3://crabby-images/746b3/746b3e5719d1a96b4414bbed0302c537c3eb0fae" alt="plot transforms illustrations"
RandomAutocontrast¶
RandomAutocontrast
轉換(另請參閱 autocontrast()
)隨機將自動對比應用於給定影像。
autocontraster = v2.RandomAutocontrast()
autocontrasted_imgs = [autocontraster(orig_img) for _ in range(4)]
plot([orig_img] + autocontrasted_imgs)
data:image/s3,"s3://crabby-images/d3e2b/d3e2bc15c240ae3e6da58c22630c84541e107157" alt="plot transforms illustrations"
RandomEqualize¶
RandomEqualize
轉換(另請參閱 equalize()
)隨機均衡化給定影像的直方圖。
equalizer = v2.RandomEqualize()
equalized_imgs = [equalizer(orig_img) for _ in range(4)]
plot([orig_img] + equalized_imgs)
data:image/s3,"s3://crabby-images/c2a0a/c2a0a4cfc908f7c32100ec5c3242afc8bc94d613" alt="plot transforms illustrations"
JPEG¶
JPEG
轉換(另請參閱 jpeg()
)以隨機壓縮程度將 JPEG 壓縮應用於給定影像。
data:image/s3,"s3://crabby-images/8d3ca/8d3ca499639d4a21adc3d49411a4842da1419629" alt="plot transforms illustrations"
擴增轉換¶
以下轉換是多個轉換的組合,可以是幾何轉換或光度轉換,或兩者兼具。
AutoAugment¶
AutoAugment
轉換會根據給定的自動擴增策略自動擴增資料。請參閱 AutoAugmentPolicy
以取得可用的策略。
policies = [v2.AutoAugmentPolicy.CIFAR10, v2.AutoAugmentPolicy.IMAGENET, v2.AutoAugmentPolicy.SVHN]
augmenters = [v2.AutoAugment(policy) for policy in policies]
imgs = [
[augmenter(orig_img) for _ in range(4)]
for augmenter in augmenters
]
row_title = [str(policy).split('.')[-1] for policy in policies]
plot([[orig_img] + row for row in imgs], row_title=row_title)
data:image/s3,"s3://crabby-images/9cc23/9cc2303669298f375787ad1bf0c959e696e484a1" alt="plot transforms illustrations"
RandAugment¶
RandAugment
是 AutoAugment 的替代版本。
augmenter = v2.RandAugment()
imgs = [augmenter(orig_img) for _ in range(4)]
plot([orig_img] + imgs)
data:image/s3,"s3://crabby-images/e4542/e45425e4c3fc23f4aaacd10e7bd5887755659bdd" alt="plot transforms illustrations"
TrivialAugmentWide¶
TrivialAugmentWide
是 AutoAugment 的替代實作。但是,它不是多次轉換影像,而是僅使用來自給定清單的隨機轉換和隨機強度數字來轉換影像一次。
augmenter = v2.TrivialAugmentWide()
imgs = [augmenter(orig_img) for _ in range(4)]
plot([orig_img] + imgs)
data:image/s3,"s3://crabby-images/5fcb3/5fcb3cc756a9ace0e11a387956a0f0cfdadfcf8b" alt="plot transforms illustrations"
AugMix¶
AugMix
轉換在影像的擴增版本之間進行插值。
data:image/s3,"s3://crabby-images/726dd/726ddaa7d0e44693327c4f856df2a9d3cdd93fbc" alt="plot transforms illustrations"
隨機應用轉換¶
以下轉換是以給定機率 p
隨機應用。也就是說,給定 p = 0.5
,即使使用相同的轉換實例呼叫,也有 50% 的機率傳回原始影像,以及 50% 的機率傳回轉換後的影像!
RandomHorizontalFlip¶
RandomHorizontalFlip
轉換(另請參閱 hflip()
)以給定機率對影像執行水平翻轉。
hflipper = v2.RandomHorizontalFlip(p=0.5)
transformed_imgs = [hflipper(orig_img) for _ in range(4)]
plot([orig_img] + transformed_imgs)
data:image/s3,"s3://crabby-images/d5560/d5560fb18dc0d1170f99ce3e99eeb81a02191eff" alt="plot transforms illustrations"
RandomVerticalFlip¶
RandomVerticalFlip
轉換(另請參閱 vflip()
)以給定機率對影像執行垂直翻轉。
vflipper = v2.RandomVerticalFlip(p=0.5)
transformed_imgs = [vflipper(orig_img) for _ in range(4)]
plot([orig_img] + transformed_imgs)
data:image/s3,"s3://crabby-images/9d41c/9d41ca181cdea0c4d3eefee23ab8d842a89d1cd8" alt="plot transforms illustrations"
RandomApply¶
RandomApply
轉換以給定機率隨機應用轉換清單。
applier = v2.RandomApply(transforms=[v2.RandomCrop(size=(64, 64))], p=0.5)
transformed_imgs = [applier(orig_img) for _ in range(4)]
plot([orig_img] + transformed_imgs)
data:image/s3,"s3://crabby-images/f65c6/f65c6fd979c0a6ad6d300fe027b6748d4084c906" alt="plot transforms illustrations"
指令碼的總執行時間: (0 分鐘 6.437 秒)