?url_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rft.title=Causal-Story%3A+Local+Causal+Attention+Utilizing+Parameter-Efficient+Tuning+for+Visual+Story+Synthesis&rft.creator=Song%2C+Tianyi&rft.creator=Cao%2C+Jiuxin&rft.creator=Wang%2C+Kun&rft.creator=Liu%2C+Bo&rft.creator=Zhang%2C+Xiaofeng&rft.description=The+excellent+text-to-image+synthesis+capability+of+diffusion+models+has+driven+progress+in+synthesizing+coherent+visual+stories.+The+current+state-of-the-art+method+combines+the+features+of+historical+captions%2C+historical+frames%2C+and+the+current+captions+as+conditions+for+generating+the+current+frame.+However%2C+this+method+treats+each+historical+frame+and+caption+as+the+same+contribution.+It+connects+them+in+order+with+equal+weights%2C+ignoring+that+not+all+historical+conditions+are+associated+with+the+generation+of+the+current+frame.+To+address+this+issue%2C+we+propose+Causal-Story.+This+model+incorporates+a+local+causal+attention+mechanism+that+considers+the+causal+relationship+between+previous+captions%2C+frames%2C+and+current+captions.+By+assigning+weights+based+on+this+relationship%2C+Causal-Story+generates+the+current+frame%2C+thereby+improving+the+global+consistency+of+story+generation.+We+evaluated+our+model+on+the+PororoSV+and+FlintstonesSV+datasets+and+obtained+state-of-the-art+FID+scores%2C+and+the+generated+frames+also+demonstrate+better+storytelling+in+visuals.&rft.subject=Training%2C+Image+quality%2C+Visualization%2C+Coherence%2C+Signal+processing%2C+Acoustics%2C+Speech+processing&rft.publisher=IEEE&rft.date=2024-03-18&rft.type=Proceedings+paper&rft.language=eng&rft.source=+++++In%3A++ICASSP+2024+-+2024+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+(ICASSP).++(pp.+pp.+3350-3354).++IEEE%3A+Seoul%2C+Korea%2C+Republic+of.+(2024)+++++&rft.format=application%2Fpdf&rft.identifier=https%3A%2F%2Fdiscovery.ucl.ac.uk%2Fid%2Feprint%2F10199974%2F1%2FSong_2309.09553v4.pdf&rft.identifier=https%3A%2F%2Fdiscovery.ucl.ac.uk%2Fid%2Feprint%2F10199974%2F&rft.rights=open