Logo

Publikacije (56)

Nazad
Selma Kurtović, Arman Hasanbegović, Senka Krivic

The integration of deep learning into symbolic music generation presents new opportunities for emulating artist-specific musical styles. In this paper, we propose a multi-branch Long Short-Term Memory (LSTM) network designed to generate monophonic melodies conditioned on note pitch, duration, and playback, with a focus on stylistic imitation of The Beatles. Unlike existing approaches that model music solely as sequences of pitches, our model processes three distinct streams of musical attributes and learns joint temporal dependencies through a custom architecture. We introduce a structured data representation derived from 193 MIDI files of Beatles songs using the music21 toolkit, extracting pitch and duration features and quantizing them into a format suitable for sequential prediction. Experimental results demonstrate that the model captures artist-specific musical patterns with moderate accuracy across output branches, and a listening test involving 71 participants validates the perceptual plausibility of the generated compositions. Our findings suggest that feature-aware sequence modeling is effective for stylistically informed symbolic music generation, and we discuss limitations and future extensions toward polyphonic modeling and conditional generation.

Aldin Dzelo, Adna Šestić, Andrej Gams, Senka Krivic

Manipulating deformable objects such as textiles remains a significant challenge in robotics due to their complex dynamics, unpredictable configurations, and high-dimensional state space. In this paper, we present an integrated planning and execution framework for robotic cloth manipulation that combines symbolic task planning, physics-based simulation, and vision-guided real-world control. High-level plans are generated using Planning Domain Definition Language (PDDL) and translated into low-level primitive actions executed on a Franka Emika Panda robot. To enable perceptual grounding, we employ a vision-based pipeline using an adapted CeDiRNet model for cloth corner detection and grasp point estimation. The system is first validated in IsaacSim using a particle-based deformable cloth model and then transferred to a real-world setup with consistent performance. Our results demonstrate successful Sim2Real transfer of high-level plans, enabling reliable execution of folding tasks in both simulated and physical environments.

Fatima Kaljanac, Amina Mevic, Senka Krivic

Air pollution, particularly the concentration of particulate matter ($\mathbf{P M}_{\mathbf{1 0}}$), poses significant risks to human health and the environment. In this study, we employed the FB Prophet forecasting model to predict PM10 levels in Sarajevo and applied SHAP to enhance model interpretability. Using historical meteorological data and PM10 concentration records, we evaluated the model’s performance across three prediction horizons: 30, 60, and 90 days ahead. SHAP analysis identified the key meteorological drivers influencing $\mathbf{P M}_{10}$ concentrations. Accurate long-horizon predictions can support timely planning and decision-making, while also enhancing our understanding of air pollution dynamics in Sarajevo and providing valuable guidance for environmental management and public health strategies.

Amarnath Reddy Kallam, Nikitha Edulakanti, Senka Krivic

Last-mile delivery remains a significant logistical challenge, requiring cost-effective automation to optimize routes, resource allocation, and package handling. This paper presents a low-cost automation framework for last-mile delivery that integrates a capacitated vehicle routing problem (CVRP) with dynamic pickups, 3D bin packing, and multi-modal optimization. We propose scalable algorithms that leverage computer vision for volume estimation, heuristic-based route planning, and dynamic scheduling to minimize operational costs while maximizing efficiency. Experimental results demonstrate reduced travel distances, improved load utilization, and enhanced delivery reliability. Our approach provides a practical and scalable solution for automating last-mile logistics, ensuring timely and cost-efficient deliveries.

Introductory programming courses are widely known for their difficulty among students. Success in courses is commonly measured in the form of final grades, which might not capture the challenges students face during their learning process. In this paper, we predict students’ success and their future compiler errors based on previously made errors. Furthermore, we examine the effect of applying two clustering techniques before making the predictions and identify key weeks and errors that have the greatest impact on predictions. Experimental results show that students’ compiler errors observed through the semester are an important predictor of students’ achievement and future struggles. Predictions are further improved using sentence encoder-generated embeddings with K-Means algorithm. Our study suggests that students’ errors, particularly the most recent ones, enable meaningful clustering that enhances performance prediction after only three weeks of the semester.

Understanding how students perceive and utilize Large Language Models (LLMs) and how these interactions relate to their learning behavior and individual differences is crucial for optimizing educational process and outcomes. This paper introduces a novel dataset comprising weekly self-reported data from students in an introductory programming course, i.e., students’ AI tool usage, perceived difficulty of weekly subject areas, personality traits, preferred learning styles, and general attitudes toward AI. We present a descriptive overview of the collected data and conduct a correlation analysis to gain first insights into the students’ individual differences and their learning outcomes, frequency of AI tools usage, as well as their attitudes toward AI. The findings reveal that while individual student characteristics did not show significant correlations with final performance or frequency of AI tool usage, the combination of students’ expectations for success and their perceived value of the task (constructs of expectancy theory) were significantly associated with both course outcomes and how often they used the AI tool. Additionally, motivational factors may be the key to fostering positive attitudes toward AI, while personality traits, particularly those related to negative emotionality, may play a more significant role in shaping resistance. This initial analysis lays the groundwork for future investigations on the prospects of AI in support of the students’ learning process.

This paper introduces affordance-based explanations of robot navigational decisions. The rationale behind affordance-based explanations draws on the theory of affordances, a principle rooted in ecological psychology that describes potential actions the objects in the environment offer to the robot. We demonstrate how affordances can be incorporated into visual and textual explanations for common robot navigation and path-planning scenarios. Furthermore, we formalize and categorize the concept of affordance-based explanations and connect it to existing explanation types in robotics. We present the results of a user study that shows participants to be, on average, highly satisfied with visual-textual, i.e., multimodal, affordance-based explanations of robot navigation. Furthermore, we investigate the complexity of different types of textual affordance-based explanations. Our research contributes to the expanding domain of explainable robotics, focusing on explaining robot actions in navigation.

Mouad Abrini, Omri Abend, Dina M. Acklin, H. Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja et al.

This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

Amina Mevic, Andreas Laber, S. Szedmák, Dženana Đonko, Senka Krivic

Technologies such as virtual metrology (VM), which monitors fabrication processes and predict product properties without physical measurements have numerous positive impacts. In this paper, we propose a VM system that predicts multiple physical properties of metal layers after the physical vapor deposition. We employ the Projective Selection (ProjSe) algorithm, which is suitable for variable selection in multioutput problems, to investigate the relationship between process parameters and layer properties. The effectiveness of the feature selection process combined with different regression models is demonstrated on real-world datasets collected from semiconductor manufacturer Infineon Technologies AG.

In robotics, ensuring that autonomous systems are comprehensible and accountable to users is essential for effective human-robot interaction. This paper introduces a novel approach that integrates user-centered design principles directly into the core of robot path planning processes. We propose a probabilistic framework for automated planning of explanations for robot navigation, where the preferences of different users regarding explanations are probabilistically modeled to tailor the stochasticity of the real-world human-robot interaction and the communication of decisions of the robot and its actions towards humans. This approach aims to enhance the transparency of robot path planning and adapt to diverse user explanation needs by anticipating the types of explanations that will satisfy individual users.

To bring robots into human everyday life, their capacity for social interaction must increase. One way for robots to acquire social skills is by assigning them the concept of identity. This research focuses on the concept of \textit{Explanation Identity} within the broader context of robots' roles in society, particularly their ability to interact socially and explain decisions. Explanation Identity refers to the combination of characteristics and approaches robots use to justify their actions to humans. Drawing from different technical and social disciplines, we introduce Explanation Identity as a multidisciplinary concept and discuss its importance in Human-Robot Interaction. Our theoretical framework highlights the necessity for robots to adapt their explanations to the user's context, demonstrating empathy and ethical integrity. This research emphasizes the dynamic nature of robot identity and guides the integration of explanation capabilities in social robots, aiming to improve user engagement and acceptance.

This study scrutinizes five years of Sarajevo’s Air Quality Index (AQI) data using diverse machine learning models — Fourier autoregressive integrated moving average (Fourier ARIMA), Prophet, and Long short-term memory (LSTM)—to forecast AQI levels. Focusing on various prediction frames, we evaluate model performances and identify optimal strategies for different temporal granularities. Our research unveils subtle insights into each model’s efficacy, shedding light on their strengths and limitations in predicting AQI across varied timeframes. This research presents a robust framework for automatic optimization of AQI predictions, emphasizing the influence of temporal granularity on prediction accuracy, automatically selecting the most efficient models and parameters. These insights hold significant implications for data-driven decision-making in urban air quality control, paving the way for proactive and targeted interventions to improve air quality in Sarajevo and similar urban environments.

The choices made by autonomous robots in social settings bear consequences for humans and their presumptions of robot behavior. Explanations can serve to alleviate detrimental impacts on humans and amplify their comprehension of robot decisions. We model the process of explanation generation for robot navigation as an automated planning problem considering different possible explanation attributes. Our visual and textual explanations of a robot’s navigation are influenced by the robot’s personality. Moreover, they account for different contextual, environmental, and spatial characteristics. We present the results of a user study demonstrating that users are more satisfied with multimodal than unimodal explanations. Additionally, our findings reveal low user satisfaction with explanations of a robot with extreme personality traits. In conclusion, we deliberate on potential future research directions and the associated constraints. Our work advocates for fostering socially adept and safe autonomous robot navigation.

Amar Halilovic, Vanchha Chandrayan, Senka Krivic

The decisions made by autonomous robots hold substantial influence over how humans perceive their behavior. One way to alleviate potential negative impressions of such decisions by humans and enhance human comprehension of them is through explaining. We introduce visual and textual explanations integrated into robot navigation, considering the surrounding environmental context. To gauge the effectiveness of our approach, we conducted a comprehensive user study, assessing user satisfaction across different forms of explanation representation. Our empirical findings reveal a notable discrepancy in user satisfaction, with significantly higher levels observed for explanations that adopt a multimodal format, as opposed to those relying solely on unimodal representations.

Nema pronađenih rezultata, molimo da izmjenite uslove pretrage i pokušate ponovo!

Pretplatite se na novosti o BH Akademskom Imeniku

Ova stranica koristi kolačiće da bi vam pružila najbolje iskustvo

Saznaj više